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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Patent Application of 

NASSIFetal Atty. Ref.: 1721-13 

Serial No. Unknown Group: 

Filed: January 12, 1999 Examiner: 

For: DNA AND SPECIFIC PROTEINS OR PEPTIDES OF THE 
NEISSERIA MENINGITIDIS SPECIES BACTERIA, 
METHOD FOR OBTAINING THEM AND THEIR 
BIOLOGICAL APPLICATIONS 

*********** 

January 12, 1999 

Assistant Commissioner for Patents 
Washington, DC 20231 

Sir: 

PRELIMINARY AMENDMENT 

Prior to calculation of the filing fee and in order to place the above identified 
application in better condition for examination, please amend the claims as follows: 
IN THE CLAIMS 

1 . (Amended) DNAs, [characterized in that they are in all or part genes, with 
their] comprising a reading framef, present in] of Neisseria meningitidis ([called] Nm 
[below]), but absent both from Neisseria gonorrhoeae ([called] Ng [below]) and from 
Neisseria Pactamica [sic] ([called] N1 [below]), with the exception of genes involved in 
the biosynthesis of the polysaccharide capsule, frpA, frpC, opc, pork, rotamase, the 
sequence IC1 106 [sic], IgA proteases, pilin, pilC, proteins which bind transferrin and 
opacity proteins. 

Claim 2, line 1 , delete "characterized in that they" and insert - which are -. 
Claims 3, 4 and 5, lines 1 and 2 of each, delete "characterized in that they 
comprise" and insert - comprising -. 
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Claims 6 and 7, line 1 of each, delete "characterized in that their" and insert 
- comprising a --; and line 2 of each, after "sequence" insert - which --. 

Claim 8, line 1, delete "characterized in that they" and insert - which -. 

Claims 9 and 10, lines 1 and 2 of each, delete "characterized in that their 
sequence corresponds" and insert -- which correspond --. 

Claim 1 1 , line 1 , delete "characterized in that they" and insert - which -. 

Claims 12 and 13, lines 1 and 2 of each, delete "characterized in that they 
comprise" and insert - comprising -. 

Claim 14, lines 1 and 2, delete "any one of the preceding claims, characterized in 
that it" and insert - claim 1 which --. 

Claim 15, lines 1 and 2, delete "any only of claims 1 to 14, characterized in that" 
and insert - claim 1 wherein -. 

Claim 16, lines 1 and 2, delete "any one of claims 1 to 15, characterized in that 
it" and insert - claim 1 which --. 

17. (Amended) Host cell, [more particularly bacterial cell or Nm cell,] 
transformed by insertion of at least one DNA according to [any one of claims 1 to 15] 
claim 1 . 

18. (Amended) Cell comprising genes or gene fragments specific to Nm, [more 
particularly bacterial cell or Nm cell,] the chromosome of which is deleted by at least 
one DNA according to [any one of claims 1 to15,] claim 1 in particular a DNA 
responsible for the pathogenicity. 
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19. (Amended) DNAs, [characterized in that their sequence] which corresponds 
in all or part to the transcription of at least one DNA sequence or sequence fragment 
according to [any one of claims 1 to 15] claim 1 . 

20. (Amended) Antisense nucleic acids, [characterized in that their] which have 
a sequence [corresponds] corresponding to the antisense of at least one nucleotide 
sequence according to [any one of claims 1 to 15 or 19,] claim 1 or a fragment of such 
a sequence, and in that they carry, where appropriate, at least one chemical 
substituent, such as a methyl group and/or a glycosyl group. 

Claim 21, lines 3 and 4, delete "any one of claims 1 to 15 or 19," and insert 
-- claim 1 --. 

Claim 22, lines 2 through 4, delete more specifically peptides corresponding to 
all or part of those coded by a DNA according to claim 14". 
Claim 23, line 3, delete "20 or". 
Claim 26, line 2, delete "or 25". 

Claim 28, line 8, delete "one of claims 1 to 15 or 19" and insert - claim 1 -. 
Claim 29, lines 5 and 6, delete "any one of claims 21 or 22" and insert 
claim 21--. 

30. (Amended) Kits for carrying out a method according to [any one of claims 
28 or 29] claim 28 , characterized in that they comprise 

- at least one of said reagent [as defined in claim 28 or 29, that is to say of the 
nucleic acid, antibody or peptide type], 
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- products, in particular markers or buffers, which enable the intended nucleotide 
hybridization reaction or immunological reaction to be carried out, as well as use 
instructions. 

31. (Amended) Vaccine composition including in its spectrum, in particular in 
combination with at least one childhood vaccine, antimeningococcal prophylaxis and 
intended for prevention of any form of infection by Neisseria meningitidis, characterized 
in that it comprises, in combination with (a) physiologically acceptable vehicle(s), an 
effective amount: 

- of peptide according to claim 21 [or 22], or 

- of an antibody or anti-antibody fragment [according to claim 23] thereto , 

this material optionally being conjugated, in order to reinforce its immunogenicity, 
with a carrier molecule such as a poliovirus protein, tetanus toxin, protein produced by 
the hypervariable region of a pilin. 

32. (Amended) Vaccine composition including in its spectrum, in particular in 
combination with at least one childhood vaccine, antimeningococcal prophylaxis and 
intended for prevention of any form of infection by Neisseria meningitidis, characterized 
in that it comprises, in combination with (a) physiologically acceptable vehicle(s), an 
effective amount: 

- of nucleic acids according to [any one of claims 1 to 15 or 19] claim 1 or 

- of cells [according to claim 17 or 18] containing same . 
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REMARKS 



The above amendments are made to place the claims in a more traditional 
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1 100 North Glebe Road, 8th Floor 
Arlington, VA 22201-4714 
Telephone: (703) 816-4000 
Facsimile: (703)816-4100 



format. 



Respectfully submitted, 
NIXON & VANDERHYE P.C. 




B.J. Sadoff 
Reg. No. 36,663 
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_DNAs and proteins or pepti des__s^ci f ic ^ to^bacteria of the 
species Neisseria meningitidis , processes for obtaining them 
ancT their biological uses. 



5 The invention relates to DNAs and to proteins and 

peptides which are specific to bacteria of the species 
Neisseria meningitidis (abbreviated below to Nm) , to the 
process for obtaining them and to their biological uses, in 
particular for the prevention and detection of meningococcal 
10 infections and meningitis. 

It is known that Nm is one of the main agents of 
cerebrospinal meningitis. 

Studies conducted in the United States have shown that 5 
to 1 0% of the population are asymptomatic carriers of the Nm 
15 strain (s). The transmission factors of Nm are poorly known. 
For a proportion of persons infected, Nm penetrates the 
bloodstream, where it can cause meningococcaemia and/or 
progress to the cerebrospinal stream, to cause meningitis. 
Without fast antibiotic treatment, the infection can develop 
20 like lightning and become fatal. 

Compared with other pathogens, Nm has the characteristic 
of being able to cross the haemato-encephalic barrier to 
colonize the meninges. The study of the pathogenicity of Nm is 
therefore important not only in the context of meningitis, but 
25 also in the context of any disease which affects the brain. 

The benefit of having available tools specific to this 
species of bacteria for the uses envisaged above is therefore 
understood. 

Genetically, Nm is very close to bacteria of the species 
30 Neisseria gonorrhoeae (abbreviated to Ng below) and of the 
species Neisseria lactamica (abbreviated to Nl below) . 
However, their pathogenicity is very different. 
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Nm colonizes the nasopharynx, and then crosses the 
pharyngeal epithelium to invade the submucous space, thus 
being responsible for septicaemia and meningitis. 

Ng is especially responsible for infections located in 
5 the genitourinary tract. It colonizes the genital mucosa, and 
then crosses the epithelium, subsequently invading the 
subepithelium, where it multiplies and is responsible for a 
severe inflammatory reaction. Disseminated gonococcal 
infections are possible, but remain rare and are the result of 
10 only some strains. 

As regards Nl , it is considered that this is a non-pathogenic 
strain, since it is not responsible for a localized or general 
invasion . 

A first consideration thus led to taking into account the 
15 fact that Nm and Ng, while being bacteria very close to one 

another, have different pathogenic potencies. 

Since the genome of these bacteria has a high homology, 

only limited parts of the genome of Nm and Ng must code for 

specific virulence factors responsible for their pathogenesis. 
20 It is clear that Nm has, compared with Ng, DNA sequences 

which are specific to it and which must be involved in the 

expression of its specific pathogenic potency. 

The species Nm is subdivided into serogroups based on the 

nature of the capsular polysaccharides. 
25 At least 13 serogroups have been defined, among which 

serogroups A, B and C are responsible for about 90% of 

meningitis cases . Groups A and C are found in epidemic forms 

of the disease. Group B is the serogroup generally isolated 

the most in Europe and the United States. 
30 The capsule , which is present in Nm and absent from Ng, 

has served as the basis for formulating meningococcal 

antimeningitis vaccines. 
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The polysaccharides of the Nm capsule have been used to 
formulate a vaccine which has proved to be effective in 
preventing in adults the meningitis caused by meningococci of 
serogroups A, C, W135 and Y. 
5 However, the polysaccharide of Nm group C has proved to 

be weakly immunogenic in children of less than two years, 
while the polysaccharide of Nm group B is non-immunogenic in 
man and shares epitopes with adhesion glycoproteins present in 
human neuronal cells. 
10 There is therefore no universal vaccine capable of 

preventing infections caused by all the serogroups of the 
meningococci and capable of responding to the intrinsic 
antigenic variability of bacterial pathogens in general and Nm 
in particular. 

15 Because of the cross-reactivity of the Nm group B 

polysaccharide with human antigens, the multiplicity of the 
serogroups and the antigenic variability of Nm, the strategies 
proposed to date cannot lead to a vaccine which is effective 
in all situations. 

20 Research is therefore concentrated on study of the 

characteristic elements responsible for the specificity of the 
meningococcal pathogenesis* 

The majority of genes which have been studied in either 
of the two bacteria Nm or Ng have their homologue in the 

25 second bacterium. 

In the same way, the majority of virulence factors 
identified to date in Nm have a counterpart in Ng, that is to 
say pilin, the Pile proteins, the opacity proteins and the 
receptors of lactoferrin and transferrin. 

30 The specific attributes of meningococci characterized in 

the prior art are the capsule, the Frp proteins analogous to 
RTX toxins, Opc proteins of the external member, glutathione 
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peroxidase, the porin PorA and the rotamase gene. 

Among these, only the capsule is invariably present in 
the virulent strains of Nm. However, several extracellular 
pathogens have a capsule without nevertheless crossing the 
5 haemato-encephalic barrier. 

Attributes which have not yet been identified must 
therefore be responsible for the specificity of the 
meningococcal pathogenesis. These attributes are probably 
coded by DNA sequences present among the meningococci but 
10 absent from the gonococci. 

The inventors have developed a new approach based on 
subtractive isolation of Nm-specific genes, which genes must 
be linked to the specific pathogenesis of Nm, and more 
particularly to crossing of the haemat o-encephaiic barrier. 
1 5 The subtractive method developed in the prior art has 

resulted in the production of epidemological [sic] markers for 
some Nm isolates. These markers are of limited use: they do 
not cover all the serogroups of the Nm species. 

In contrast to these studies, the work of the inventors 
20 has led, by confronting Nm with the entire Ng chromosome 
sheared in a random manner, to the development of a means for 
cloning all the DNAs present in Nm and absent from Ng, thus 
providing tools of high specificity with respect to Nm, and 
thus enabling the genetic variability of the species to be 
25 responded to for the first time. 

The terms "present" and absent" used in the description 
and claims in relation to the DNAs of a strain or their 
expression products are interpreted on the basis of identical 
hybridization conditions (16 h at 65^C, with NaP0 4 0.5 M, pH 
30 7.2; EDTA-Na 0.001 M, 1%, 1% bovine serum albumin and 7% 
sodium dodecylsulphate) using the same probe and the same 
labelling intensity of the probe, the same amount of 
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chromosomal DNA and the same comparison element (chromosomal 
DNA of the homologous strain) . 

It is therefore considered that the DNA is present if the 
signal obtained with the probe is practically the same as that 
5 obtained with the reference strain. 

Conversely, it is considered that the DNA is absent if 
this signal appears very weak. 

A second consideration of the pathogenicities of Nm and 
Ng leads to taking into account their common capacity for 

10 colonization and penetration of the mucosa, and then invasion 
of the subepithelial space of the latter. It is highly 

( probable that this process involves virulence factors common 

to the two pathogens. In this respect, it is known that a 
certain number of virulence factors have already been 

15 identified in Nm and in Ng, such as the pili proteins, Pile, 
the opacity proteins, the IgA proteases, the proteins for 
binding to transferrin and to lactoferrin, and the 
lipooligosaccharides . 

The approach of the inventors is thus extended to 

20 investigation of the Nm regions which are specific to Nm and 
Ng but absent from the non-pathogenic species Nl, and in a 
general manner to investigation of the chromosomal regions of 
the DNAs and their expression products specific to a given 
species by the means developed in accordance with the 

25 invention. 

The object of the invention is thus to provide DNAs of Nm 
specific to its pathogenic potency and means for obtaining 
them, in particular by formulating banks formed exclusively 
from these Nm-specific DNAs. 
30 It also provides the products derived from these DNA 

sequences . 

The invention also relates to the utilization of specific 
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and exhaustive characteristics of these banks to formulate 
tools which can be used, in particular, in diagnostics, 
treatment and prevention. 

The DNAs of the invention are characterized in that they 

5 are in all or part genes, with their reading frame, present in 
Neisseria meningitidis , but absent both from Neisseria 
gonorrhoeae and from A 7 ersseria lactamica , with the exception 
of genes involved in the biosynthesis of the polysaccharide 
capsule, frpA, frpC, opc, por A, rotamase, the sequence 

0 IS1106, IgA proteases, pilin, pile, proteins which bind 
transferrin and opacity proteins. 

As stated above, the terms "present" and "absent" are 
interpreted on the basis of the hybridization conditions used 
in the Southern blotting described in the examples and 

5 referred to above. 

It can be seen that these DNAs include variants where 
these express a function intrinsic to the Nm species , more 
particularly a phenotype found solely in Nm or in common 
exclusively with Ng. 

0 According to a main aspect, these DNAs are specific to 

the pathogenicity of Neisseria meningitidis , in spite of the 
genetic variability of this species. 

According to an embodiment of the invention, the said 
DNAs are specific to Nm, in contrast to Ng. 

5 More particularly, the Nm- specific DNAs are absent from 

Neisseria lactamica and from Neisseria cinerea. 

Surprisingly, the majority of genetic differences between 
the strains of meningococci and those of gonococci appear 
grouped in distinct regions, which are said to correspond to 

0 the pathogenicity islets described previously for E . coll and 
Y. pestis. 

In a preferred embodiment of the invention, these DNA are 
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thus also characterized in that they comprise one or more 
sequence (s) present on the chromosome of Neisseria 
meningitidis Z2491 between tufh and pilT, or region 1 of the 
chromosome, and/or the sequence (s) capable of hybridizing with 
5 the above sequence (s) , with the proviso of being specific to 
Neisseria meningitidis . 

"Specific" in the description and the claims means the 
nucleotide sequences which hybridize only with those of Nm 
under the hybridization conditions given in the examples and 
10 referred to above. 

In this respect, it can be seen that, in a general 
manner, when n all or part" of a sequence is referred to in the 
description and claims, this expression must be interpreted 
with respect to the specificity defined above. 
15 Furthermore, all or part of a peptide or a fragment of a 

peptide or an antibody means a product having the biological 
properties respectively of the natural peptide or the antibody 
formed against the peptide. 

Genes of the Neisseria meningitidis capsule are grouped 
20 in region 1 . 

DNAs of this type have a sequence corresponding in all or 
part to SEQ ID No. 9, 13, 22 or 30, and/or to any sequence 
located at more or less 20 kb from these SEQ ID on the 
chromosome of an Nm strain, and/or have a sequence which is 
25 capable of hybridizing with at least a fragment of any one of 
these sequences. 

In another preferred embodiment of the invention, these 
DNA are also characterized in that they are made up of one or 
more sequence (s) present on the chromosome of Neisseria 
meningitidis Z2491 between pilQ and A.740, or region 2 of the 
chromosome, and/or the sequences (s) capable of hybridizing 
with the above sequence (s) , with the proviso of being specific 
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to Neisseria meningitidis . 

DNAs according to this embodiment have a sequence 
corresponding in all or part to SEQ ID No. 1, 2, 4, 6, 7, 10, 
15, 31 or 34 , and/or to any sequence located at more or less 

5 20 kb from these SEQ ID on the chromosome of an Nm strain, 
and/or have a sequence which is capable of hybridizing with at 
least a fragment of any one of these sequences. 

The invention especially provides all or part of the DNA 
sequence SEQ ID No. 36 of 1 5,620 bp, and the sequences 

0 corresponding to the open reading frames SEQ ID No. 37, SEQ ID 
No. 38, SEQ ID No. 39, SEQ ID No. 40, SEQ ID No. 41, SEQ ID 
No. 42, SEQ ID No. 43, SEQ ID No. 44 and SEQ ID No. 45. 

In yet another preferred embodiment of the invention, 
these DNAs are also characterized in that they are made up of 

5 one or more sequence (s) present on the chromosome of Neisseria 
meningitidis Z2491 between argF and opaB, or region 3 of the 
chromosome, and/or the sequence (s) capable of hybridizing with 
the above sequence (s) , with the proviso of being specific to 
Neisseria meningitidis . 

0 DNAs according to this embodiment are characterized in 

that they have a sequence corresponding in all or part to SEQ 
ID No. 8, 21 , 23, 25, 26, 28, 29, 32 or 35, and/or to any 
sequence located at more or less 20 kb from these SEQ ID on 
the chromosome of an Nm strain, and/or have a sequence which 

5 is capable of hybridizing with at least a fragment of any one 
of these sequences. 

Regions 1 , 2 and 3 identified above have a high 
proportion of sequences specific to Neisseria meningitidis and 
also fall within the context of the invention. 

0 Other DNAs representative of the specificity with respect 

to Neisseria meningitidis have one or more sequences which 
is/are present on the chromosome of Neisseria meningitidis 
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Z2491 but are not part of regions 1, 2 and 3 defined above. 

Such DNAs comprise one or more sequence (s) corresponding 
in all or part to SEQ ID No. 3, 5, 11, 12, 14, 16, 18, 19, 20, 
24, 27 or 33, and/or to any sequence located at more or less 
20 kb from these SEQ ID on the chromosome of an Nm strain, 
and/or have a sequence capable of hybridizing with such 
sequences . 

Taking into account the uses envisaged in particular, the 
invention more specifically relates to the above DNAs involved 
in the pathogenesis of the bacterial organism. 

In particular, it provides the DNAs corresponding to at 
^east one of the characterizations given above and coding for 
a protein exported beyond the cytoplasmic membrane, and/or of 
wftich all or part of their sequence corresponds to the 
conserved region of the said DNAs. 

According to another embodiment of the invention, the 
DNAs are thus common with those of Ng, but are absent from 
those of Nl. 

These are more specifically the DNAs which are present on 
region 4 (arg J to reg F) or on region 5 (lambda 375 marker to 
pen A) on the chromosome of Nm Z2491 and/or are capable of 
hybridizing with the said DNAs present, with the proviso of 
being specific to Nm and Ng, in contrast to Nl. 

"Specific to Nm and Ng in contrast to Nl" means the DNAs 
25 which hybridize with the DNAs of Nm and Ng under the 
hybridization conditions of the examples (see example 4 in 
particular) . 

The DNAs of regions 4 and 5 and those capable of 
hybridizing with these DNAs , with the proviso of expressing 
the intrinsic functions of Nm, have the advantage of 
intervening in a significant manner in the virulence of Nm, 
being involved in the stage of initial colonization and 
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penetration and in the septicaemic dissemination. 

According to other embodiments, the invention provides 
transfer and expression vectors, such as plasmids, cosmids or 
bacteriophages, comprising at least one DNA as defined above. 

It also provides host cells transformed by at least one 
DNA as defined above. 

Other host cells of the invention comprise genes or gene 
fragments specific to Nm, and are characterized in that their 
chromosome is deleted by at least one DNA according to the 
invention, in particular a DNA responsible for the 
pathogenicity. They are more specifically bacterial cells, in 
particular of Nm. 

The invention also relates to the RNAs of which the 
sequence corresponds in ail or part to the transcription of at 
least one DNA sequence or sequence fragment as defined above. 

The invention also relates to the antisense nucleic acids 
of the DNAs as defined above, or of fragments of these DNAs. 

These antisense nucleic acids carry, where appropriate, 
at least one substituent, such as a methyl group and/or a 
glycosyl group . 

Other products which fall within the context of the 
invention include polypeptides. 

These polypeptides are characterized in that they have an 
amino acid chain corresponding to all or part of a sequence 
coded by the nucleic acids defined above, or deduced from 
sequences of these nucleic acids. 

They are advantageously polypeptides corresponding to all 
or part of the polypeptides exported beyond the cytoplasmic 
membrane, more specifically polypeptides corresponding to all 
or part of those coded by a conserved region. 

As a variant, the polypeptides of the invention can be 
modified with respect to those corresponding to the nucleic 
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acid sequences such that they are particularly suitable for a 
given use, in particular use as a vaccine. 

Modification is understood as meaning any alteration, 
deletion or chemical substitution where this does not affect 
5 the biochemical properties of the corresponding natural 
polypeptides, more specifically of functional proteins 
exported at the periplasm and the external membrane. 

Other products according to the invention include 
antibodies directed against the above polypeptides. 
10 The invention thus provides polyclonal antibodies, and 

also monoclonal antibodies, characterized in that they 
recognize at least one epitope of a polypeptide as described 
above . 

It also relates to fragments of these antibodies, more 
15 particularly the fragments Fv, Fab and Fab ' 2 . 

The invention also relates to the anti-antibodies which 
are capable of recognizing the antibodies defined above, or 
their fragments, by a reaction of the antigen-antibody type. 

According to the invention, the various products 
20 considered above are obtained by a synthesis and/or biological 
route in accordance with conventional techniques. 

The nucleic acids can also be obtained from banks made up 
of Nm-specific DNAs such as are formulated by a subtractive 
technique, this technique comprising: 
25 - mixing of two DNA populations, 

- realization of at least one subtractive hybridization- 
amplification iteration, and 

- collection of the desired DNA or DNAs, followed, where 
appropriate, by its/their purification with elimination of 

30 redundant sequences. 

According to the invention, the two DNA populations 
originate respectively from a strain of Neisseria 
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meningitidis , the so-called reference strain for which the 
specific bank must be constructed, and a strain of Neisseria, 
the so-called subtraction strain, having a homology in primary 
DNA sequences of greater than about 70% with the Neisseria 
5 meningitidis strain, the DNA sequences of the subtraction and 
reference strains being obtained respectively by random 
shearing, and by cleavage by a restriction endonuclease 
capable of producing fragments less than about 1 kb in size. 

The invention provides in particular a process for 
10 obtaining Neisseria meningitidis-speclfic DNA banks, 
comprising the stages of 

- random shearing of the chromosomal DNA of a strain of 
Neisseria gonorrhoeae , the so-called subtraction strain, in 
particular by repeated passage through a syringe, 

15 - cleavage of the chromosomal DNA of a strain of 

Neisseria meningitidis , the so-called reference strain, 
preferably by a restriction enzyme producing fragments less 
than about 1 kb in size, 

- splicing of the DNA fragments of the reference strain, 
20 cleaved by the restriction enzyme, with suitable 

oligonucleotide primers, 

realization of a subtractive hybridization- 
amplification iteration, by: 

mixing of the two DNA populations under suitable 
25 conditions for hybridization of homologous sequences, and then 
amplification of auto-reannealed fragments and 
collection of these fragments, 

. digestion of these fragments by a restriction enzyme 
and re-splicing with oligonucleotide primers, followed by a 
30 - purification of the spliced DNA and, where appropriate, 

a new iteration of the subtractive hybridization, comprising 
mixing of DNA fragments of Neisseria gonorrhoeae sheared as 
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indicated above with DNA fragments of Neisseria meningitidis 
produced by the preceding iteration, followed, if desired, by 
cloning of the DNAs of the bank. 

The primers used are oligodeoxynucleotide primers which 
are suitable for the restriction endonuclease used and allow 
insertion into a cloning site, such as the EcoRI site of the 
plasmid pBluescript. Such primers will advantageously be 
chosen among the oligodeoxynucleotides referred to in the 
sequence listing under SEQ ID no. 36 to 45. 

The banks thus obtained are formed from DNAs which are 
specific to meningococci and absent from gonococci. 

The specificity of the DNAs was verified, as described in 
the examples, at each iteration by Southern blots, with genes 
common to the subtraction strain and to the reference strain, 
or with the total DNA of each of the strains digested by a 
restriction endonuclease, such as Clal. 

At each iteration, the exhaustivity of the DNA bank was 
also verified by Southern blotting with probes known to be 
specific to the reference strain, that is to say for Neisseria 
meningitidis the frp, opc and rotamase genes in particular. 

The experiments carried out showed that the banks 
obtained by the process of the invention are deficient in 
genes having a significant homology with species of Neisseria 
other than Neisseria meningitidis , for example the ppk or 
pilC1 genes, generally in only 2 or 3 iterations. 

If necessary, two routes, which are not exclusive of each 
other, can be taken. 

It is possible to proceed with an (n+1 ) th iteration using 
the DNA of iteration n as the DNA population of the reference 
strain. 

As a variant, a second bank independent of the first is 
constructed, with a restriction enzyme of different 
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specificity to that used in the first bank, for example Mbol . 

In all cases, it is preferable to keep each of the 
products produced by each of the iterations performed. 

The invention also provides the use of the subtractive 
technique described above to obtain banks of the DNAs common 
to Nm and Ng, but specific with respect to Nl . 

Three different banks are advantageously constructed, two 
of them by digestion of the chromosomal DNA of Nm by MboX and 
Tsp5091 , and the third by digestion of the chromosomal DNA of 
Nm with Mspl. Two subtraction series allow the DNAs having the 
required specificity to be collected, as described in the 
examples . 

The invention also relates to the process for obtaining 
these banks and the banks themselves. 

It can be seen that, generally, the process of the 
invention can be used to obtain banks of DNAs specific to a 
given cell species, or to a given variant of the same species, 
where another species or another variant which is close 
genomically and expresses different pathogenic potencies 
exists . 

Using the process of the invention, DNA banks specific to 
given species of cryptococci, Haemophilus, pneumococci or also 
Escherichia coli, or more generally any bacterial agent 
belonging to the same species and having different pathovars 
will advantageously be constructed. 

Furthermore, from these banks the invention provides the 
means to have available virulence factors specific to a 
species or a given variant. 

Such banks are therefore tools which are of great 
interest for having available attributes which are responsible 
for the specificity of a pathogen, this use being more 
specifically illustrated according to the invention by the 
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obtaining of banks comprising the attributes responsible for 
the specificity of the meningococcal pathogenesis. 

Study of the products of the invention, the nucleic 
acids, polypeptides and antibodies, has enabled an absolute 
specificity with respect ot Neisseria meningitidis, regardless 
of the strain and its variability, to be demonstrated. 

These products are therefore particularly suitable for 
diagnosis or prevention of infections and meningitis caused by 
Neisseria meningitidis, whether in adults or children and 
regardless of the serogroups of the strain in question. 

The method for diagnosis, according to the invention, of 
a meningococcal infection, and more particularly of 
meningococcal meningitis, by demonstration of the presence of 
Neisseria meningitis in an analytical sample is characterized 
by the stages of: 

- bringing into contact a sample to be analysed, that is 
to say a biological sample or a cell culture, and a reagent 
formulated from at least one nucleic acid as defined above, if 
appropriate in the form of a nucleotide probe or a primer, or, 
as a variant, from at least one antibody or a fragment of an 
antibody as defined above, under conditions which allow, 
respectively, hybridization or a reaction of the antigen- 
antibody type, and 

- detection of any reaction product formed. 

If the reagent is formulated from a nucleic acid, this 
can be in the form of a nucleotide probe in which the nucleic 
acid or a fragment of the latter is labelled in order to 
enable it to be detected. Suitable markers include 
radioactive, fluorescent, enzymatic or luminescent markers. 

As a variant, the nucleic acid is included in a host 
cell, which is used as the reagent. 

In these various forms, the nucleic acid is used as such 
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or in the form of a composition with inert vehicles. 

If the reagent is compiled from an antibody, or a 
fragment of an antibody, this can be labelled for detection 
purposes. Most generally, a fluorescent, enzymatic, 
radioactive or luminescent marker is used. 

The antibody or the antibody fragment used, which is 
labelled if appropriate, can be used as such or in the form of 
a composition with inert vehicles. 

The sample used in the stage of bringing the components 
into contact is a biological sample produced by a mammal, such 
as cephalorachidian fluid, urine, blood or saliva. 

The detection stage is carried out under conditions which 
allow the reaction product to be demonstrated when it is 
formed. Conventional means use fluorescence, luminescence, 
colour or radioactive reactions, or also autoriadography [sic] 
techniques. It is also possible to quantify the product. 

The invention also relates to the labelled products, the 
nucleic acids and antibodies, as new products. 

The method defined above can be used for diagnosis of an 
immune reaction specific to a meningococcal infection. 

The reagent used is thus a polypeptide according to the 
invention, as coded by the said nucleic acid sequences, 
corresponding to the natural product or a polypeptide which is 
modified but has the biological and immunological activity of 
the corresponding natural polypeptide. 

It is advantageously a polypeptide exported beyond the 
cytoplasmic membrane of Neisseria meningitidis, more 
particularly the part of such a polypeptide corresponding to 
the conserved region of the DNA. 

The invention also relates to kits for carrying out the 
methods defined above. These kits are characterized in that 
they comprise: 
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- at least one reagent as defined above, that is to say 
of the nucleic acid, antibody or polypeptide type, 

- products, in particular markers or buffers, which 
enable the intended nucleotide hybridization reaction or 

5 immunological reaction to be carried out, as well as use 
instructions . 

The specificity of the products of the invention and 
their location on the chromosome of Neisseria meningitidis 
Z2491 , either grouped in a region and able to be interpreted 
10 as pathogenicity islets, or isolated on the chromosome, impart 
to them a very particular interest for realization of vaccine 
^ compositions with a universal purpose, that is to say whatever 

the strain and the variability which it expresses. These 
iu compositions can include in their spectrum other prophylaxes, 

dp 15 an <3 can be, for example, combined with childhood vaccines. 
m The invention thus provides vaccine compositions which 

w include in their spectrum antimeningococcal prophylaxis, 

O intended for prevention of any infection which may be caused 

f|| by Neisseria meningitidis, these compositions being 

JJJ 20 characterized in that they comprise, in combination with (a) 
C physiologically acceptable vehicle (s), an effective amount of 

polypeptides or anti-antibodies or their fragments as defined 

m 

^ above, these products optionally being conjugated, in order to 

reinforce their immogenicity [sic] . 
25 Immunogenic molecules which can be used comprise the 

poliovirus protein, the tetanus toxin, or also the protein 
produced by the hypervariable region of a pilin. 

As a variant, the vaccine compositions according to the 
invention are characterized in that they comprise, in 
30 combination with (a) physiologically acceptable vehicle (s), an 
effective amount: 

- of nucleic acids as defined above, 
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- of transformed host cells as defined above, or 

- of Nm cells, the chromosome of which has been deleted 
by at least one DNA sequence according to the invention 
involved in the pathogenicity of the bacterium. The nucleotide 
material used is advantageously placed under the control of a 
promoter of its expression in vivo and synthesis of the 
corresponding protein. To reinforce the immunogenicity , it is 
also possible to combine this nucleic material with a DNA or 
an RNA which codes for a carrier molecule, such as the 
poliovirus protein, tetanus toxin or a protein produced by the 
hypervariable region of a pilin. 

The vaccine compositions of the inventions can be 
administered parenterally , subcutaneously , intramuscularly or 
also in the form of a spray. 

Other characteristics and advantages of the invention are 
given in the examples which follow for illustration thereof, 
but without limiting its scope. 

In these examples, reference will be made to figures 1 to 
1 1 , which show, respectively, 

- figures 1A, 1B, 1C, 1D, 1E, 1F and 1G: analysis of the 
subtractive bank Tsp5091 , 

- figure 2: the distribution of the Nm-specific sequences, in 
contrast to Ng, on the chromosome of the strain Z2491 (left- 
hand part) and of Nm-specific sequences, in contrast to Nl 
(right-hand part) , 

- figure 3A to 3C: the reactivity of the clones of the 3 
regions of the chromosome according to the invention towards a 
panel of strains of the genus Neisseria, 

- figure 4: the position in region 2 of the chromosome of Nm 
of oligonucleotides used as probes, 

- figures 5, 6 and 7: the Southern blots of a panel of strains 
of the genus Neisseria, using parts of region 2 of Nm as 
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probes , 

- figures 8A to 8C; the Southern blots with 3 subtractive 
banks over a panel of 12 strains of Neisseria, and 

- figures 9, 10 and 11: the reactivity of clones of the 3 
subtractive banks with respect to Nm, Nl and Ng. 

In the examples which follow, the following materials and 
methods were used: 

Bacterial strains - To obtain the subtractive banks, strain 
Z2491 of Nm (Achtman et al., 1991, J. Infect. Dis. 164, 375- 
382), the strains MS1 1 (Swanson et al . , 1 974, Infect. Immun. 
10, 633-644) and the strains 8064 and 9764 of Nl were used, it 
being understood that any other strain of the species in 
question could be used. 

In order to verify the specificity of these banks, 6 
strains of Nm, 4 strains of Ng, one strain of Nl {Neisseria 
lac t arnica) and one strain of Nc (Neisseria cinerea) were used. 

The six strains of Nm are: Nm Z2491 of serogroup A, Nm 
8013 of serogroup C (XN collection), Nm 1121, no serogrouping 
possible (XN collection), Nm 1912 serogroup A (XN collection), 
Nm 7972 of serogroup A (XN collection) and Nm 8216 of 
serogroup B (XN collection) . 

The four strains of Ng are: Ng MS1 1 (Pasteur Institute, 
Paris), Ng 403 (Pasteur Institute, Paris), Ng 6934 (Pasteur 
Institute, Paris), Ng WI (isolated from a disseminated 
gonococcal infection), Ng 4C1, Ng 6493 and Ng FA 1090. 

The strains of Nl are Nl 8064 and Nl 9764 (XN 
collection), and that of Nc is Nc 32165 (XN collection). 

Molecular genetics techniques 

Unless indicated otherwise, the techniques and reagents 
used correspond to those recommended by Sambrook et al 
(Sambrook et al 1989, Molecular Cloning: A Laboratory Manual. 
Cold Spring Harbor Laboratory Press) . The 
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oligodeoxynucleotides used in this study are: 
RBAm12, 3 1 AGTGGCTCCTAG 54 (SEQ ID No. 54) 

RBam24, 5* AGCACTCTCCAGCCTCTCACCGAG 3'; (SEQ IN No. 55) 
Jbam12, 3* GAT C C GT T CAT G 5'; (SEQ ID No. 60) 
JBAM24, 5' ACCGACGTCGACTATCCATGAACG 3'; (SEQ ID No. 61) 
REco12, AGTGGCTCTTAA; (SEQ ID No. 56) 

REco24, 5' AGCACTCTCCAGCCTCTCACCGAG 3 ! ; ( = RBam 24) 
JEco12, GTACTTGCTTAA; (SEQ ID No. 62) 
JEco24, 5' ACCGACGTCGACTATCCATGAACG 3'; (= JBam24) 
NEco12, AATTCTCCCTCG; (SEQ ID No. 64) 
NEco24, AGGCAACTGTGCTATCCGAGGGAG; (SEQ ID No. 65). 
Transfer to membranes (Southern blots) 

The transfers to membranes were effected by capillary 
transfers to positively charged nylon membranes (Boehringer 
Mannheim) . The hybridizations were carried out at 65^c in a 
solution comprising NaPi [sic] 0.5 M pH 7.2/EDTA 1 mM/SDS 
7%/BSA 1%. The membranes were washed in a solution comprising 
NaPi [sic] 40 mM pH 7.2/EDTA 1 mM/SDS 1%. The final washing 
was carried out at 65^C for 5 min. 

The probe frp obtained with oligonucleotides based on the 
frpA sequence corresponds to 2.4 kb of the 5 f end of the gene 
of the strain Z2491 . The opc and rotamase probes corresponding 
to whole genes are produced from the strain Z2491 using 
oligonucleotides constructed on the basis of published 
sequences. The probes pilCl and ppk (polyphosphate kinase) 
correspond to inserts of the plasmids pJL1 and pBluePPK6001 
respectively. 

Example — 1_: — Construction of banks of DNAs present in Nm and 
absent from Ng. 
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a. "Mbol" bank 

Construction - The DNA of Nm Z2491 was cleaved by the 
endonuclease Mbol and subjected to two iterations of a method 
called CDA (comprehensive difference analysis) below. This 
method comprises subtractive hybridization in the presence of 
excess sheared DNA of Ng MS11 and amplification by PCR of 
those meningococcal sequences which, since they are absent 
from or do not have significant homology with the DNA of Ng 
MSI 1 , could reanneal 

The chromosomal DNA of the strain Ng MS1 1 is sheared 
randomly by repeated passage through a hypodermic syringe 
until fragments of a size ranging from 3 to 10 kb are 
obtained. These DNA fragments are purified by extraction with 
phenol . 

The chromosomal DNA of the strain Nm Z2491 is itself 
cleaved by the restriction endonuclease Mbol. These DNA 
fragments (20 lag) are spliced with 10 nmol of annealed 
oligonucleotides RBaml 2 and RBam24. The excess primers are 
removed by electrophoresis over 2% agarose gel of low melting 
point. The part of the gel containing amplified fragments 
greater than 200 bp in size is excised and digested by (3- 
agarase. These fragments are purified by extraction with 
phenol . 

To carry out a subtractive hybridization (first 
iteration), 0.2 [xg of the Nm DNA spliced with the RBam 
oligonucleotides is mixed with 40 [ig Ng DNA in a total volume 
of 8 ml of a buffer EE 3X (a buffer EE 1X is composed of N-(2- 
hydroxyethyl)piperazine-N' - (3-propanesulphonic acid) 10 mM and 
EDTA 1 mM, and its pH is 8.0). This solution is covered with 
mineral oil and the DNA is denatured by heating at 100^c for 2 
min. 2 y.1 NaCl 5 M are added and the mixture is left to 
hybridize at 552C for 48 h. The reaction mixture is diluted to 
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1/10 in a preheated solution composed of NaCl and buffer EE, 
and in then immediately placed on ice. 

10 [il of this dilution are added to 400 ^1 of PCR 
reaction mixture (Tris.HCl pH 9.0 10 mM; KC1 50 mM; MgCl 2 1.5 
mM; Triton X100 0.1%; 0.25 mM of each of the four triphosphate 
deoxynucleotides; Taq polymerase 50 units per ml) . The mixture 
is incubated for 3 mm at 70^C to complete the ends of the 
reannealed meningococcal DNA fragments. 

After denaturing at 94^C for 5 min and addition of the 
oligonucleotide RBam24 in an amount of 0.1 nmol per 100 ul , 
the hybridizations are amplified by PCR (30 cycles of 1 min at 
942c, 1 min at 70^C and 3 min at 12^C f followed by 1 min at 
94^C and 10 min at 72^C; Perkin-Elmer GeneAmp 9600). 

The amplified meningococcal fragments are separated from 
the primers and high molecular weight gonococcal DNAs on gel. 
They are digested by Mbol and the oligonucleotides JBaml 2 and 
J3am 24 are spliced to them again. These spliced DNAs are 
again purified over gel and extracted with phenol. 

A second iteration of the subtractive hybridization is 
carried out on 4 0 ug of the randomly sheared Ng DNA and 25 ng 
of the DNA spliced with the JBam oligonucleotides obtained 
from the first iteration of the subtractive hybridization. 
During this second iteration, amplification of the auto- 
annealed Nm DNA is effected with the aid of the 
oligonucleotide JBam24 . 

Specificity - In order to confirm their Nm specificity, 
the amplified sequences after the second iteration of the CDA 
method are labelled and used as a probe for the DNA digested 
by Clal produced from a panel of six strains of Neisseria 
meningitidis, four of Neisseria gonorrhoeae, one of Neisseria 
lactamica and one of Neisseria cinerea. 

The Southern blots obtained show that the amplified 
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sequences resulting from the second iteration of the CDA 
method have a high reactivity with several bands corresponding 
to meningococci, and do not have a reactivity with the bands 
corresponding to the Ng, Nl and Nc strains. 

The "Mbol" bank thus appears to be Nm-specific. 

Exhaustivity - In order to test the exhaustivity of the 
bank, all the products produced from the first and second 
iterations of the CDA method and also the initial chromosomal 
materials of Nm Z2481 [sic] and Ng MS11 are subjected to 
agarose gel electrophoresis, transferred to a membrane and 
brought into contact with probes comprising genes known to be 
meningococcus-specific, that is to say frp, opc and rotamase 
(Southern blotting) . 

As a result of these hybridizations, the Nm-specific gene 
frp is represented in the Mbol bank by a fragment of 600 bp, 
but no activity is observed for the rotamase and opc genes. 
The Mbol bank, although Nm-specific, therefore cannot be 
considered exhaustive. 

Given their high specificity, the fragments produced by 
the second iteration of the CDA method for the Mbol bank can 
nevertheless be cloned on the BamHI site of the plasmid 
pBluescript . 

A sequence corresponding to any of the Nm-specific genes 
can be included in the subtractive bank only if it is carried 
by a restriction fragment of appropriate size. This condition 
is a function of two factors. Firstly, the probability that 
the largest fragments are entirely Nm-specific is low. 
Secondly, even if such fragments existed, they would be under- 
represented in the bank because of the limitations of the PCR 
technique, the amplification effectiveness of which decreases 
with increasing size of the fragments. Fragments greater that 
about 600 bp in size are not included in the bank. Because of 
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the absence of Mbo fragments of suitable size from the 
chromosome of Nm Z2491 , the rotamase and opc genes cannot be 
included in the bank. Any enzyme cannot by itself produce a 
small fragment corresponding to any Nm-specific gene. A second 
bank was therefore constructed using another restriction 
enzyme with a different specificity: Tsp509 [sic] . 

b. "Tsp5091" bank 

Construction - The enzyme Tsp5091 has the advantage of 
producing fragments of smaller size (less than about 1 kb) 
than the enzyme Mbol . 

Tsp509l recognizes the sequence AATT and leaves, 
projecting at 5', a sequence of 4 bases compatible with £coRI . 
The oligonucleotides used are Reco, Jeco and NEco . 

The method followed conforms with that followed for 
construction of the "Mbol" bank described above. However, 
higher quantities of meningococcal DNA were used for the first 
iteration of the subtractive hybridization in order to 
compensate for the higher number of fragments of low molecular 
weight produced by Tsp509l. For the first iteration, 400 ng Nm 
DNA fragments and, in the second, 25 ng Nm fragments are 
subjected to subtractive hybridization with 40 jag randomly 
sheared Ng DNA. 

For the construction of this "Tsp509l" bank, as a 
control, a third iteration of the subtractive hybridization is 
carried out using 40 (ig sheared Ng DNA and 0.2 ng Nm fragments 
resulting from a digestion by Tsp509l and a resplicing, with 
NEco adaptors, of the fragments obtained as a result of the 
second iteration. 

Specificity - As described for the previous bank, the 
product resulting from the second iteration of the CDA method 
is labelled and used as the probe for a panel of strains of 
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Neisseria . 

Figure 1A illustrates the Southern blot hybridization of 
products of the second iteration of the CDA method with the 
DNA digested by Clal of: Nm in track a, Ng MS1 1 in track b, Nm 
8013 in track c, Ng 403 in track d, Nm 1121 in track e, Ng 
6934 in track f, Nm 1912 in track g, Ng wi (strain DGI) in 
track h, Nm 7972 in track i, Nl 8064 in track j, Nc 321 65 in 
track k, Nm 8216 in track 1. 

In contrast to the high reactivity observed with all the 
Nm strains, a low or no reactivity is observed with the Ng, Nl 
and Nc strains. 

The specificity of the bank was studied earlier by 
reacting membrane transfers (Southern blots) of the products 
produced by each of the three iterations of the CDA method 
with probes corresponding to pilC1 and ppk. These two genes 
are common to Nm and Ng. 

Figure 1B shows an agarose gel after electrophoresis of 
the chromosomes of Nm Z2491 and Ng Ms11, digested by Tsp509 
[sic] , and products resulting from each of the iterations of 
the CDA method. 

In track a 1 ng of the chromosome of Nm was deposited, in 
track b 1 jig of that of Ng, in track c 0.15 ^g of the products 
resulting from the first CDA iteration, in track d 0.1 ng of 
those of the second iteration, in track e 0.05 of the third 
iteration, MW representing the molecular size markers. 

Figures 1C and 1D show gels obtained as described in 
figure 1B after transfer to the membrane (Southern blots) and 
hybridization with pild (figure 1C) and ppk (figure 1D). 

At the end of the second iteration of the CDA method, the 
sequences corresponding to the piJC1 and ppk genes are 
completely excluded from the bank. 

Exhaustivity - The exhaustivity of the bank was examined 
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by reacting the products resulting from the subtractive 
hybridization with the probes corresponding to three Nm- 
specific genes (f~rp, rotamase and opc) . 

These Nm-specific probes react with the amplification 
products resulting from the first and second iteration of the 
subtractive hybridization. 

Figures 1E, "IF and 1G show gels obtained as described in 
figure 1B after transfer to the membrane (Southern blots) and 
hybridization with frpA (figure 1E), rotamase (figure 1F) and 
opc (figure 1G) . 

However, a third iteration of the subtractive 
hybridization leads to the loss of Nm-specific sequences, 
since the fragments which react with the rotamase and opc 
genes are absent from this third iteration. 

In consideration of all these data, it emerges that the 
products resulting from the second iteration of the CDA method 
are Nm-specific and also constitute an exhaustive bank of Nm- 
specific sequences. 

The products resulting from this second iteration are 
cloned at the EcoRI site of the plasmid pBluescript. 

The bank produced by Tsp509l is more exhautive [sic] than 
the bank produced by Mbol, as the theory considerations based 
on the enzymatic production of smaller restriction fragments 
would suggest. 

In accordance with this aspect, it should be noted that 
the Tsp509l bank is less redundant than the Mbol bank, that is 
to say it comprises less duplication of clones. 86% of the 
clones of the Tsp509I bank correspond to distinct sequences, 
while only 43% of the clones correspond to distinct sequences 
in the Mbol bank (data not shown) . 

The bank produced by Tsp509l thus constitutes a source of 
Nm-specific clones. 
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Example 2: Analysis of the clones of the subtractive bank 

Cloning and sequencing of the Nm-specific DNAs 

The DNAs of the subtractive banks are clones at the BamHl 
(Mbol bank) or EcoRI (Tsp509l bank) site of the plasmid 
pBluescript, and then transformed in DHScc of E. coll. The 
inserts are amplified by PCR carried out on the transformed 
colonies using the primers M13-50 and M13-40, the latter 
primer being biotinylated on its 5* end. 

Sequencing was carried out on each PCR product after 
separation of the biotinylated and non-biotinylated strands 
using the system of Dynabeads M-280 with streptavidin (Dynal, 
Oslo) . The sequences are screened according to their 
homologies with previously published sequences using the 
computer programs Blastn and Blastx (NCBI, USA and Fasta) . 

The PCR products resulting from the transformed bacteria 
colonies after using the primers M13-40 and M13-50 as 
described above were labelled by incorporation with random 
priming of a- 32 P-dCTP and were used as a probe for the membrane 
transfers of the chromosomal DNA digested by Clal of strains 
Nm Z2491 and Ng MS11 , as described above, in order to verify 
their specificity . 

Mapping of clones on the chromosome of the strain Nm 
Z2491 . 

The results of studies carried out with 17 clones of the 
"Mbol" bank (designated by the letter B) and 16 clones of the 
"Tsp5091 " bank (designated by the letter E) , 'each of these 
clones having a unique sequence and being without counterpart 
in Ng, are reported. 

The positions of the DNA sequences corresponding to 
cloned Nm-specific products were determined with respect to 



REPLACEMENT SHEET (RULE 26) 



28 



the published map of the chromosome of Nm Z2491 (Dempsey et 
al. 1995, J. Bacterid. V77, 6390-6400) and with the aid of 
transfers to membranes (Southern blots) of agarose gel 
subjected to pulsed field electrophoresis (PFGE) . 

The Nm-specific clones are used as probes for a 
hybridization on membranes (Southern blots) of the DNA of Nm 
Z2491 digested with enzymes of rare cutting sites, that is to 
say Pad, Pmel , Sgfl , Bglll , Spel Nhel and Sgfl . 

The gels (20 x 20 cm) were gels of 1% agarose in a buffer 
T3E 0.5X and were subjected to electrophoresis at 6 V/cm for 
36 hours according to pulsation periods varying linearly 
between 5 and 35 seconds. 

The hybridizations on the membrane (Southern blots) were 
carried out as described above. 

The results obtained are shown on figure 2: the 
reactivity was located by comparison with the positions of the 
fragments of corresponding size on the published map. The 
positions of all the genetic markers mapped by Dempsey et al 
(mentioned above) are visualized with the aid of points on the 
linear chromosomal map. The Nm-specific genes disclosed 
previously are labelled with an asterisk. The two loci called 
"frp" correspond to the frpA and frpC genes. The "pile" loci 
correspond to the pild and p±lC2 genes, which are pairs of 
homologous genes and are not distinguished on the map. The 
accuracy of the positions of the Nm-specific clones of the 
invention depends on the overlapping of reactive restriction 
fragments. On average, the position is +/- 20 kb. 

This mapping reveals a non-random distribution of the Nm- 
specific sequences. The majority of the Nm-specific sequences 
belong to three distinct groups. One of these groups (region 
1 ) corresponds to the position of genes relating to the 
capsule which have been described previously. 
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A distinction is made between: 

- El 09, E138, B230 and B323 as being region 1, 

- B322, B220, B108, B132, B233, B328, E139, E145 as B101 
as being region 2, and 

- B306, E114, E115, E124, E146, E120, E107, E1 37 and 142 
as being region 3. 

63% of the sequences identified as specific to meningococci 
are located inside these three distinct regions. 

This grouping contrasts with the distribution of 
previously disclosed Nm-specific genes {frpA, frpC, porA, opc 
and the region relating to the capsule) . 
P This prior art would suggest in fact that the Nm-specific 

genes, with the exception of functional genes relating to the 
* capsule, were dispersed along the chromosome. 

P 15 Mapping of Nm-specific sequences on the chromosome leads 

fjj to an unexpected result with regard to the prior art. 

" majority of the genetic differences between the 

□ meningococcal and gonococcal strains tested are grouped in 

i| three distinct regions. 

S 20 Meningococcal genes relating to the capsule are grouped 

^ in region 1 . 

The function of genes of the other regions is unknown, 
but homologies with published sequences (table 1) suggest 
similarities between certain genes of region 3 and 

25 bacteriophage transposase and regulatory proteins. No 
meningococcal virus has been characterized and it is tempting 
to think that these sequences are of phagic origin. 
Interestingly, the genome of H. influenzae also contains a 
sequence homologous to that of the Ner regulatory protein of 

30 phage Mu, but it is not known if it is a functional gene. 

The clone B208 has a high homology (48% identical, 91% 
homology for 33 amino acids) with a clone of conserved regions 
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(field III) in the class of proteins which bind to TonB- 
dependent ferric siderophors. 

The proximity of this clone with the Nm-specific porA 
genes and the frp genes regulated by iron, and in particular 
the possibility that it is an Nm-specific receptor protein 
exposed on the external membrane in itself is a good candidate 
for further research. 

The clone B339 corresponds to the Nm-specific insertion 
sequence IS 7 1 06 . 

The low homology between the clone B1 34 and the Aeromonas 
insertion sequence and also the presence of multiple copies of 
the clone B1 34 among the various strains of Nm suggest that it 
could be a new type of Nm-specific insertion sequence. 

The possibility that the regions containing the Nm- 
specific clones could correspond to pathogenicity islets as 
described previously for E, coli and Y. pestis is of 
particular interest . 

The clones isolated in this invention will allow better 
understanding of the relevance of Nm-specific regions in 
allowing cloning and sequencing of larger chromosomal 
fragments, and directly by their use for loci mutations. 

Finally, detection of meningococcus-specif ic genes 
possibly involved in the pathogenicity of the organism allows 
targeting of suitable antigens which can be used in an 
antimeningococcal vaccine . 

The effectiveness and the speed of the method according 
to the inventions enables it to be used in a large number of 
situations for which the genetic differences responsible for a 
phenotype peculiar to one of 2 close pathogens are 
investigated. 
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Study of the reactivity of the clones of regions 1 , 2 and 3 
towards a panel of strains of Neisseria. 

The PCR products corresponding to inserts of each of the 
clones were collected and used as probes for hybridization on 
membranes (Southern blots) for a panel of strains of Nm, Ng, 
Nl and Nc. 

Regions 1 and 2 produce a limited number of bands for 
each of the meningococci. This suggests that these regions are 
both Nn-specific and common to all the meningococci. 

Figure 3 illustrates the reactivity of the clones of 
regions 1, 2 and 3 towards a panel of neisserial strains. The 
clones of regions 1 (figure 3A) , 2 (figure 3B) and 3 (figure 
3C) taken together were used as probes towards a panel of 
meningococci, gonococci and towards a strain of Nl and Nc. 

The tracks are as follows: DNA of: Nm Z2491 in track a, 
of Ng MS11 in track b, of Nm 8013 in track c, of Ng 403 in 
track d, of Nm 1121 in track e, of Ng 6934 in track f, of Nm 
1912 in track g, of Ng WI (strain DGI) in track h, of Nm 7972 
in track i, of Nl 8064 in track j , of Nc 32165 in track k, and 
of Nm 8216 in track 1. 

Remarkably, region 3 has reactivity only with the 
meningococci of serogroup A. This region 3 is therefore 
specific to a sub-group of Nm. 

A comparison was made with the known sequences in the 
databanks in order to evaluate the possible functions of the 
cloned regions. 

Table 1 which follows gives the positions of specific 
clones on the chromosomal map and the homologies with known 
sequences . 
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Felll pyochelin receptor 
P. aeruginosa (5.10 A ) 
















Transposase 
Bacteriophage D3112 
(6 x 10 u ) 


Protein Ner-Likc 
H. influenzae (6 x 10"") 
Protein binding to the DNA 
Ner , phage mu (3 x 1 0~ 1fl ) 








Hypothetical protein 
H1 1 730 H. influenzae 
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Firstly, it can be seen that the clones of region 1 all 
correspond to genes involved in biosynthesis of the capsule. 
These genes have previously been studied among the Nm of 
serogroup B (Frosch et al. 1989, Proc. Natl. Acad. Sci. USA 
M, 1669-1673 and Frosch and Muller 1993, Mol. Microbiol. 8 
483-493) . 

With the exception of a low homology with the haemolysin 
activator of Serratia marcescens, the clones of region 2 have 
no significant homology with published sequences, either in 
the DNA or the proteins. 

Two of the clones of region 3 have interesting homologies 
with proteins which bind to the DNA, in particular the 
bacteriophage regulatory proteins and transposase proteins. 

Clone B208 has a high homology with one of the conserved 
regions in one class of receptors (TonB-dependent ferric 
sidercphor) . 

Clones B1 34 and B339 hybridize with several regions of 
the chromosome (at least 5 and at least 8 respectively) . 

Data relating to the sequences show that clone B339 
corresponds to the Nm-specific insertion sequence 31 106. 

The translation of the clone B143 has a limited homology 
with the transposase of an Aeromonas insertion sequence (SAS2) 
(Gustafson et al. 1994, J. Mol. Biol. 237 . 452-463). We were 
able to demonstrate by transfer on a membrane (Southern blots) 
that this clone is an Nm-specific entity present in multiple 
copies in the chromosomes of every meningococcus of the panel 
tested. 

The other clones have no significant homology with the 
published neisserial sequences, and furthermore nor with any 
published sequence. These clones therefore constitute, with 
the majority of the other clones isolated, a bank of totally 
new Nm-specific loci. 
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Example 3: Study of region 2 of the Nm chromosome 

. Determination and characterization of the sequence of 
region 2 

PCR amplification is carried out with the chromosomal DNA 
of strain Z2491 of serogroup A, sub-group IV-1 using 
oligonucleotide primers formulated from each of the sequences 
of clones of region 2 in several different combinations. The 
PCR products which overlap are sequenced from the 2 strands 
using the chain termination technique and automatic sequencing 
( ABI 373 or 377) . 

To prolong the sequence beyond the limits of the clones 
available, partial SauIIIA fragments of 1 5 kb of the strain 
Z2491 are cloned in Lambda DASH- I I (Stratagene) . 

The phages containing the inserts overlapping region 2 
are identified by hybridization with clones of this region as 
probes. The DNA inserted is sequenced from the ends of the 
inserts, and these sequences are used to formulate new primers 
which will serve to amplify the chromosomal DNA directly, and 
not the phagic DNA. 

An amplification of the chromosomal DNA is obtained using 
these new primers and those of the sequence previously 
available. 

These PCR products are also sequenced from the 2 strands, 
which leads to a complete sequence of 1 5,620 bp (SEQ ID No. 
36) . The reading frames of this sequence which start with ATG 
or GTG and are characterized by a high codon usage index are 
analysed. 

This analysis reveals 7 ORFs of this type which fill the 
major part of the sequence of 1 5,620 bp. The positions of 
these ORFs are the following: 
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ORF-1: 1330 to 2970 (SEQ ID No. 37); ORF-2: 3083 to 9025 (SEQ 
ID No. 38); ORF-3: 9044 to 9472 (SEQ ID No. 39); ORF-4: 10127 
to 12118 (SEQ ID No. 40); ORF-5: 12118 to 1 2603 (SEQ ID No. 
41); ORF-6: 1 2794 to 1 3063 (SEQ ID No. 43); ORF-7: 1 3297 to 
1 4235 (SEQ ID No. 44); and ORF-8: 14241 to 15173 (SEQ ID No. 
45) . 

ORF-4 starts with the codon GTG and overlaps a slightly 
smaller ORF (SEQ ID No. 41) in the same reading frame (9620- 
"2^8, frame 2), which starts with the codon ATG. 

ORF-4 codes for a protein which has structural homologies 
wzth a family of polypeptides comprising pyocins (Pseudomonas 
aeruginosa) , collcins and intimins (Escherichia coli) , which 
are bactericidal toxins (pyocins, collcins) or surface 
proteins involved in adhesion of bacteria to eukaryotic 
proteins. ORF-7 encodes a protein, the sequence of which 
contains a potentially transmembrane region and which has 
structural homologies with periplasmic proteins or proteins 
inserted in the external membrane of bacteria. The structural 
homologies of ORF-4 and ORF-7 have been identified with the 
aid of the PropSearch program. 

Investigation of sequences homologous to other ORFs in 
GenBank with the aid of the BLAST program revealed a homology 
between the N-terminal regions of ORF-2 and filamentous 
haemagglutinin B of Bordetella pertussis (43% similarity, 36% 
identical over 352 amino acids) and between ORF-1 and the 
protein fhaC of Bordetella pertussis (35% similarity, 27% 
identical over 401 amino acids) . ORF-1 and ORF-2 are 
neighbouring genes in the strain Z249I and filamentous 
haemagglutinin B of Bordetella pertussis and fhaC are 
neighbouring genes in Bordetella pertussis, which reinforces 
the probability that these homologies reflect functional 
homologies . 
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Confirmation of the specificity of region 2 with 
respect to Nm 

Southern blots are carried out using the DNA probes 
obtained by PCR amplification of various parts of region 2 
using oligonucleotide primers formulated from sequences of 
clones of region 2. 

The approximate position of these oligonucleotides is 
shown on figure 4 . 

These are the oligonucleotides called R2001 (SEQ ID No. 
46) and R2002 (SEQ ID No. 47) in one half of ORF-1 , the 
oligonucleotides b332a (SEQ ID No. 48), e1 39a (SEQ ID No. 49), 
b132a (SEQ ID No. 50) and b233b (SEQ ID No. 51) in one half of 
ORF-1 -the majority of ORF-2, and the oligonucleotides e145a 
(SEQ ID No. 52) and blOla (SEQ ID No. 53) in 1/3 of ORF-4+ORF- 
5 to 7. 

The three Southerns are carried out under the following 
hybridization conditions: 
16 h at 65^C, 
NaPO, 0.5 M, pH 7.2 
EDTA-Na 0.001 M 
1% sodium dodecylsulphate . 

For the washing, heating is carried out for 10 min at 
65^C, and NaP0 4 0.5 M, pH 7.2; EDTA-Na 0.001 M, 1% sodium 
dodecylsulphate are used. 

Figures 5, 6 and 7 respectively show the Southern blots 
obtained with each of the abovementioned ORF parts. 

The 14 tracks correspond respectively, in each of the 
Southerns, to 
1 : MSI 1 (Ng) 
2: 403 (Ng) 
3: FA1090 (Ng) 
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4: W1 (Ng) 
5: 6493 (Ng) 

6: marker (lambda hindlll) 
7; Z2491 (Nm # gpA) 
8: 7972 (Nm gpA) 
9: 8013 (Nm, gpC) 

10: 1121 (Nm, grouping not possible) 
11: 1912 (Nm, gpB) 
"3: 32165 (Nc) 
14: 8064 (Nl) . 

Given that a panel of strains of Neisseria is used in 
these experiments and that each well is charged with a similar 
amount of digested DNA, these 3 Southern blots clearly show 
that the sequences corresponding to region 2 are found in all 
the meningococci tested and that significant homologous 
sequences do not exist in the genome of the Ng of the strains 
tested. 

Example 4jl Identification of regions of the Nm genome absent 

from Nl and common with Ng 

The technique described in example 1 is followed, but the 
chromosomal DNA of one strain of Nm (Z2491 ) and 2 strains of 
Nl (XN collections) , equal parts of the DNAs of which are 
mixed, is used. 

2 subtractions are performed using the R and J series of 
primers. Three different banks are thus obtained. 

Two banks, called Bam and Eco, are obtained respectively 
by digestion of the chromosomal DNA of Nm Z2491 by Mbol and 
Tsp5091; a third bank, called Cla, which results from 
digestion of the chromosomal DNA of Nm by Mspl , is obtained 
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RMsp24, JMspIO and JMsp24. All 
the following table 2. 

Table 2 



Adapters for differential banks 

Chromosomal DNA digested by Cloning in 
pBluescript by 

Mbol —> BamHX 

Tsp509l -» EcoRI 

Mspl Clal 

First subtraction cycle 

RBaml 2 : 3' AGTGGCTCCTAG 5' (SEQ ID No. 54) 

RBam24 : 5* AGCACTCTCCAGCCTCTCACCGAG 3 r (SEQ ID No. 55) 

REcol2 : AGTGGCTCTTAA (SEQ ID No. 56) 

RBam24 : 5' AGCACTCTCCAGCCTCTCACCGAG 3' (SEQ ID No. 55) 

(REco 24 = RBam 24) 
RMsplO : AGTGGCTGGC (SEQ ID No. 57) 

RMsp24 : 5 T AGCACTCTCCAGCCTCTCACCGAC 3 T (SEQ ID No. 

58) 

Second subtraction cycle 

Jbaml2 : 3 1 GTACTTGCCTAG 5' (SEQ ID No. 59) 

JBam24 : 5' ACCGACGTCGACTATCCATGAACG 3 f (SEQ ID No. 60) 

JEcol2 : GTACTTGCTTAA (SEQ ID No. 61) 

JBam24 : 5* ACCGACGTCGACTATCCATGAACG 3' (SEQ ID No 60) 

(JEco 24 = JBam 24) 

JMspIO : GTACTTGGGC (SEQ ID No . 62) 

JMsp24 : 5 1 ACCGACGTCGACTATCCATGAACC 3* (SEQ ID No. 63) 
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After 2 subtractions, the entire product of each 
amplification is labelled and used as a probe. 

The subtractive banks are checked by Southern blotting 
over a panel of 1 2 strains of Neisseria (chromosomal DNA cut 
by Clal) . The hybridization conditions are identical to those 
given in example 1 . 

These Southern blots are shown on figures 8A to 8C, which 
relate respectively to the MboI/BamHI bank, to the Mspl/Clal 
bank and to the Tsp5091 / EcoRI bank. 
The 12 tracks correspond respectively, to 
1 : Nm Z2491 (group A) 
2: Nl SC64 

3 : Nm 821 6 (group B) 
4: Nl 9764 

5: Nm 801 3 (group C) 
6 : Ng MS1 1 

7 : Nm 1912 (group A) 
8: Ng 4C1 

9: Nm 1121 (grouping not possible) 

10: Ng FA1090 

1 1 : Nc 321 65 

12: Nm 7972 (group A) 



Examination of the Southern blots shows that the 
sequences contained in each bank are specific to Nm and are 
not found in Nl . Furthermore, the reactivity found with the 
strains of Ng suggests that some of these sequences are 
present in Ng. 

Each of these banks was then cloned in pBluescript at the 
BamHI site for Bam, or the EcoRI sit for Eco, or the Clal site 
for Cla. In order to confirm the specificity of the clones 
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with respect to the Nm genome, restriction of the individual 
clones and radiolabelling thereof were carried out. The clones 
showing reactivity for both Nm and Ng were kept for subsequent 
studies. These clones are shown on figures 9, 10 and 11, which 
give the profiles with respect ot Nm, Nl and Ng of 5 clones of 
the Bam bank (figure 9), 16 clones of the Eco bank (figure 
10) and 13 clones of the Cla bank (figure 11). 

These clones were sequenced using universal and reverse 
primers . They are 

- Bam clones 

partial B1 1 of 140 bp (SEQ ID No. 66), partial B1 3 estimated 
at 425 bp (SEQ ID No. 67), B26 of 181 bp (SEQ ID No. 68), B33 
of 307 bp (SEQ ID No. 69), B40 of 243 bp (SEQ ID No. 70), 

- Cla clones 

C16 of 280 bp (SEQ ID No. 72), partial C20 estimated at 365 bp 
(SEQ ID No. 73), partial C24 estimated at 645 bp (SEQ ID No. 
74), partial C29 estimated at 245 bp (SEQ ID No. 75), C34 of 
381 bp (SEQ ID No. 76), C40 of 269 bp (SEQ ID No. 77), C42 of 
203 bp (SEQ ID No. 78), p C43 of 229 bp (SEQ ID No. 79), C45 
of 206 bp (SEQ ID No. 80), C47 of 224 bp (SEQ ID No. 81), C62 
of 212 bp (SEQ ID No. 82), and C1 30 (5'...) estimated at 900 
bp (SEQ ID No. 83) , and 

- Eco clones 

E2 of 308 bp (SEQ ID No. 84), partial E5 estimated at 170 bp 
(SEQ ID No. 85), partial E22 estimated at 300 bp (SEQ ID No. 
86), E23 of 273 bp (SEQ ID No. 87), E24 of 271 bp (SEQ ID No. 
88), E29 of 268 bp (SEQ ID No. 89), partial E33 estimated at 
275 bp (SEQ ID No. 90), partial E34 estimated at 365 bp (SEQ 
ID No. 91), E45 of 260 bp (SEQ ID No. 92), E59 estimated at 
greater than 380 bp (SEQ ID No. 93), E78 of 308 bp (SEQ ID No. 
94), E85 of 286 bp (SEQ ID No. 95), E87 of 238 bp (SEQ ID No. 
96), partial E94 greater than 320 bp (SEQ ID No. 97), partial 
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E103 greater than 320 bp (SEQ ID No. 98) and E11 0 of 217 bp 
(SEQ ID No. 99) . 

Mapping of each clone was carried out on the chromosome 
of Nm Z2491 as described in example 1 . The results obtained 
are given on the right-hand part of figure 2. It is found that 
these clones correspond to regions called 4 and 5. These 
regions are therefore made up of sequences present both in Nm 
and in Ng, but not found in Nl . It is therefore regarded that 
these are sequences which code for virulence factors 
responsible for the initial colonization and penetration of 
the mucosa. Region 4 is located between argF and regF on the 
chromosome of Nm 2491 [sic] , and region 5 is located between 
the iar.bda 375 marker and penA. This region probably contains 
sequences which code for an Opa variant and a protein which 
binds transferrin . 

A comparison with the known sequences in the databanks 
has half [sic] that in region 4 only the clone C1 30 has a 
homology, that is to say with MspJ methylase. In region 5, no 
homology with known sequences was found with the clones C8, 
E2, B40, C45, E23 and E103. For the other clones, the 
homologies are the following: 

B1 1 arginine decarboxylase SpeA; C29 arginine 
decarboxylase SpeA; C62 oxoglutarate/malate transporter; 
repetitive DNA element; E34 repetitive DNA element; E94 murine 
endopeptidase MepA ; C47 citrate synthase PrpC; E78 citrate 
synthase PrpC 

Example 5: Demonstration of the presence of one or more 

strains of Neisseria meningitidis in a biological sample 

A biological sample of the cephalorachidian fluid, urine, 
blood or saliva type is taken. 

After filtration and extraction, the DNAs present in this 
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sample are subjected to gel electrophoresis and transferred to 
a membrane by Southern blotting. 

A nucleotide probe constructed by labelling SEQ ID No. 5 
with is incubated with this transfer membrane. 

After autoradiography, the presence of reactive band(s) 
allows diagnosis of the presence of Neisseria meningitidis in 
the sample. 

Example 6 : Vaccine composition including in its spectrum 

antimeningococcal prophylaxis and intended for prevention of 
any form of infection by Neisseria meningitidis. 

The peptide coded by a sequence including SEQ ID No. 10 
is conjugated with a toxin. 

This conjugated peptide is then added to a composition 
comprising the anti-Haemophilus and antipneumococcal vaccine, 
or any other childhood vaccine. 

After having been sterilized, the resulting composition 
can be injected parenterally , subcutaneously or 
intramuscularly . 

This same composition can also be sprayed on to mucosa 
with the aid of a spray. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(A; NAME: I.N.S.E.R.M 

(3) STREET: 101, rue de Tolbiac 

(C) CITY: PARIS CEOEX 13 

( E } COUNTRY: FRANCE 

;f; POSTAL COTE (ZIP; : 75654 

',11) TITLE OF THE INVENTION: DNA, specific proteins and peptides of the 
Neisseria rremr.gitidis species oacrena, methods for obtaining ihe~ and 
their biological applications. 

(in) NUMBER OF SEQUENCES : 99 

[iv; COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(3) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (OEB) 



(2) INFORMATION FOR 3EQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: Z2491 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GA7CCGC7GC CGGCAGACGA AT A7 C AAGAC ATCTTCGATT TTATGAAACA GTATGACTTG 60 

TCTTACCCGT ATGAATATCT GCAGGATTGG ATAGATTACT ATACGTTCAA AACCGATAAG 120 

CTGGTATTTG GTAACGCGAA GCGAGAGTGA GCCGTAAAAC TCTGAGCTCC TGTTTTATAG 180 

ATTACAACTT TAGGCCGTCT TAAAGCTGAA AGATTTTCGA AAGCTATAAA TTGAAGCCCT 24 0 

TCCACAGTAC ATAGATC 2 57 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 6 case pairs 
(3) TYPE : nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GATCATGTTC AAATAGATAG GC AT GGGAAG CTGCAGCTCT AACGTCCATG AAAATATGTT 60 

GCATAGCTGC AAGCGGAACG CCTTTTCTTT CATCTACATA ATCTATAGAG TCAAGGCAAC 1 20 

CGCTATTGAA ATTAGCAGTA TTGCCTATGA TTACATTAGT AATATGCTCA TACCATTTTT 180 
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GGGTGGT CAT CATATTGTGC CCCATTGTTA TCTCCTTATA TTGGTTTTAG AAGGAACTTT 24 0 

GACAGGAAGA ATAACGGCCT TACCTGTTTG ACGATC 27 6 

(2) INFORMATION FOR SEQ ID NO: 3: 

{-) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 
[3) TYPE: nucleotide 

(C) STRANTEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (generic) 

(vi ) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ TO NO: 3: 

GATCTGGTGG TGTTTGCACA GG^AGGCGCA TACTTGTTCG GGACTGAGTT TGCGGCGGAT 60 

AAGGGTGTCG ATGTGCTGAA TCAGCTGCGA ATCGAGCTTA TAGGGTTGTC GCTTACGCTG 120 

TTTGATAGTC CGGCTTTGCC GCTGGGCTTT TTCGGCGCTG TATTGCTGCC CTTGGGTGCG 180 

GTGCCGTCTG ATTTCGCGGC TGATGGTGCT TTTGTGGCGG TTAAGCTGTT TGGCGATTTC 240 

GGTGACGGTG CAGTGGCGGG ACAGGTATTG GATGTGGTAT CGTTCGCCTT GGGTCAGTTG 300 

CGTGTAGCTC ATGGCAATCT TTCTTGCAGG AAAGGCCGTA TGCTACCGCA TACTGGCCTT 360 

TTTCTGTTAG GGAAAGTTGC ACTTCAAATG CGAATCCGCC GACCTCTTTC AGTTACAGCA 420 

GCTTGATC 428 

(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 
(3) TYPE: nucleotide 

(C) STRANDZDNZSS: single 

(D) TCPCLCGY: linear 

(ii) MOLECULE TYPE: TXA {generic) 

;v:: ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2451 

(xY SEQUENCE LESCRIPTION: SEQ ID NO: 4: 

G-- - CC - G^A _ _' GACA _ C G G I G — o-GG _ G - G AGGGTATT GT GKCCGrO i AaA. G .±. CGGCA _ lA c 

CCGTTGGCCA ATAAGGATAG ATGAGCGTCT GC A GAAAC AG CATGAAGGCC GTCTGAAACG 120 

AT ATT GO GOT GCAATGCGGT GGTTTGGAGA GCCTTGGCTG CGTTCAGCTT GGTATTGCGA 180 

AGCTGAATAT TGCCTTTGGC TGCCTGAATG TGCAGATTAC CCGAGTTGGT ACGCAGATTG 240 

GT AT T G GT AA CATTCAGCAA GCCTGCCTCC ACACCCATGT CTTTTGAGGC AGTGAGGGTT 3CC 

TTACTGGTGC CGGTAATATG GGCAGCGTTA TCCGATTTCA AATGGATGCT GGCCGGCAGA 360 

CAAATCTTTA TCAACATTCA AATTCAGATC 390 

[2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGIN : 

(A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GACCAGA7TG GTG.AAGACGG TAITACCGTC AATGTTGCAG GCCGTTCGGG A7A7ACGGCG 

AAAA 7 C G A_ C G 7G7C7CCGAG 7ACCGA777G GC GG777A7G GCCATATTGA AGTTGTACGG 

GGIGZ^ZGG GGTTGACCCA A7CG.AG.7rCA GAGCCGGGTG GAACCGTCAA T77GA7C 

(2) INFORMATION ECR SEQ ID XC : 6: 

(-) SEQSZ^CZ CHARACTERISTICS: 

'A v LENGTH: 34 ^ base pairs 
:b; TYPE: nuclec-ide 
:c; STRANDEDNESS: single 
;D; TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

GATCAATGA7 GCTACTATTC AAGCGGGCAG TTCCGTGTAC AGCTCCACCA AAGGCGATAC 

TGAATTGGGT GAAAATACCC GTA7TATTGC T GAAAACGT A ACCGTATTAT CTAACGGTAG 

TATTGGCAGT GCTGCTGTAA TTGAGGCTAA AG AC ACT GC A CACATTGAAT CGGGCAAACC 

GC7TTCTTTA GAAACCTCGA CCGTTGCCTC CAACATCCGT TTGAACAACG GTAACATTAA 

AGGCGGAAAG CAGCTTGC7T TACT G'GCAGA CGATAACATT ACTGCCAAAA CTACCAATCT 
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GAATACTCCC GGCAATCTGT ATGTTCATAC AGGTAAAGAT C 341 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 base pairs 
(3) TYPE: nucleotide 
(C) SIRANDEDNESS: single 
(3) TOPOLOGY: linear 

(vi ; ORIGIN: 

;a; ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z249^ 

(xi ) SEQUENCE TESCRIPTICN: SEQ ID NO: 7: 

GATCCAACTG TTTGATTTTA CTGGCTGCTT CTCCATGCGC GGTATTGACC AAAGCCGCAA 60 

GGATATTCGC TTCCAGATTG TCTTTCAGGC TGCCGCCGTT GACAGCGGTA TTAATCAGTG 20 

CGGCACTGCC CGCATTGGCT AGGTTGACGG TCAGGTTGTT GATC 164 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYRE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 
GATCAATCAC ACATCTTGTC ATTTITTCGA TTCCTTCATT TCGGTTTCTA ATGTTTCAAT 60 
TCTTGCGGCC ATTTCCTGAA TGGCTT7AGT CAAAACGGGG ATGAACGCTT CGTATTCGAC 120 
GGTGTAGGTA TCG7TTGTTT TAITIACCAI CGGCAATCGA CCATATTCAT CTTCCAGCGC 1 80 
AGCAAIG7CC TGGGCAA7AA ACCAA7GCCG CAACCGATC 219 
(2) IXF0RXA7ICX FOR SEQ 10 NO: 9: 




<vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GATCTTGGGT AAGCCCCCAA CCTGCATAGA AAGGCAGGCC GTAGCAGCTG ACTTTTTTGC 60 

CGCGCAACAA GGCTTCAAAA CCGGTCAGCG AAGTCATGGT ATGTATTTCG TCTGCGTATT 120 

GGAGACAGGT CAGGATGTCG GCTTGTTCGG CGGTTTGGTC GGCATATCGT GCAGCATCAT 1 80 

CAGGGGAAAT ATGGCCGATG CGGTTACCGC TGACTACATC GGGATGCGGT TTGTAGATGA 240 

TATAGGCATT GGGGTTTCGT TCGCGTACGG TACGGAGCAA ATCCAGATTG CGGTAGATTT 300 

GGGGCGAACC GTAGCGGATA GACGCATCAT CTTCAACCTG GCCGGGAACG AGGATC 3 56 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(::) MOLECULE TYPE: DNA ; genomic) 

'vi) ORIGIN: 

;A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2431 

;x:) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GATGLGCTTT CAGTTTCCGT ACCGGTGG OA TCAGTCAAGT CCGTTTTGTG CACCAAACCG 

CGTCOATATG AAA C AT AAAA CAAATCGCTT AAGCCCAAAG GGTTATCGAA CGATAAAGOG 

ACATTTCCTT GATATTTGCC GGTCGTTTTG CCGCCCGC AT CATCTATACC GATACTGAAC 

CGT AT GGGTT TATTCTGCTG CCATTTGATC 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN : 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GAT C C C G AAA CGCAATTGGC C G AAA GC TAT ATGCTGAACG ATGTGTTGCG GTTTTGGGAC 

AGCGCAGGTT TGGGCGATGG C-AAAGAAGCC GACCGCGCCC ATCGGCAAAA ACTGATTGAT 

GTCCTGTCTA AAACCTATAC TCATTCGGAT GGGGAGTGGG GCTGGAT AGA TTTGGTGTTC 

GTTATCCTTG ACGGCAGCTC CCGCGATTTG GGTACGGCCT ATGATTTGTT GAGGGATGTT 

™i _ C w — A- - — - — - - _ vj:.- GA _ C 

(2) INFORMATION FCR SEQ IT NC : 12: 

;i) SEQUENCE CHARACTERISTICS: 

A LENGTH: 4 36 bass pairs 
' B , TYRE: -u;lec::de 
,C, STRANTETNESS: single 
I TCP CI CUE* : linear 

(n; MOLECULE TYRE: UNA ( genomic) 

(vi ) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
[3] STRAIN: 22491 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GATCAAATGG A T GAT T T ATA TAGAATTTTC TTTTACGACT GCGTGCCGTT TGAAAAGAAA 

ATGCACAATC CCGTATCTCA TCGTGCCATA GATTTTTCAA AGACTCCGGA AGCCATATTT 

CGTTGCAATC TGCATACCGA ATTGAAGAAG AAGCGTAAAT TAGCGTTACG TTTAGGCAAG 

CTGTCGGACA ATACAGCATG GATATTAAAA CCCCAAGTCA TGAAAAATCT TCTGAAAAAC 

CCGTCAACTC AAATTACGGA AAACGATGTC GTGCTCGATG TTAAACAAAA AGGTGTAGAT 
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ATGCGTATAG GCTTGGATAT TTCATCTATT ACCTTAAAAA AACAAGCCGA TAAAATCATC 360 

TTGTTTTCTG GTGATTCCGA TTTTGTCCCA GCAGCCAAAT TAGCCAGACG GG AAGGT AT C 4 20 

GATTTTATTC TTGATC 4 36 

(2) INFORMATION FOR SEQ ID NO: 13: 

I.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 3 base pairs 
(3; TYPE: nucleotide 
(Ci STRANDEDNESS: single 
(D) TOPCIOGY: linear 




(xi) SEQTENCZ DESCRIPTION : SEQ ID NO: 13: 

GATCGTTTTA CGTCGCAATC GAGCTTTGTG GTGCGCTCGC CTAAAAGCCA ATCTTCTCTC 60 

AAT GGCCTGG GTGCCATTTT GCAGGGC AC A GGTTTTGCCC GTGCGCAAGA CGATATTTAT 120 

ACCGTGCA GG AATATATGCA GTCGCGTTCG GCTTTGGATG CGTTGCGTAA GAAAAT GCCC 180 

ATTCGCGATT TTTATGAAAA AGAAGGCGAT ATTTTCAGCC GTTTTAATGG TTTTGGCCTG 240 

CGTGGCGAGG ATGAGGCG7T TTATCAATAC TACCGTGATA AGGTATCCAT CCATTTTGAC 300 

TCTGTCTCAG GCATTTCCAA TTTGAGCGTT ACATCGTTTA ATGCCGGTGA ATCTCAAAAG 360 

ATC 363 
(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA (genomic) 

(vi ; ORIGIN: 

(A, ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z249 1 



'xi) SLQLZNCL DESCRIPTION: SEQ ID NO: 14: 
GATCTTGCGT CAT I TAT ATI TT CATC GAT A TTGCAATTAC CGCCGTTCCA GTTGAAATAA 6C 



:tcctat tcttgcgtac catttcccaa 12: 



TAATTGCGCC COACAATTTC CATTTAATGC TCCATCAGTT CTTTTACTTC CGGAAATCTG 1 SC 



CTGTAATCTG A C AT AAGAC G CAT AAT T GAA CT AT CAACGC CGTAACAGCC ATAGGTTTTA 24; 



ATACCGTTTT CGGCGTGTTC CCAAATGCAA TTACTGTATT CGTAGCCTTT TACAAATTTA 30 0 



m 

M 

m 

m 

S 

c 

4S 

III 
nj 

C TCGGTTTCGG GATC 314 



(2) INFORMATION FOR SEQ ID NO: 15: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: Z2491 
(x:) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAT CAT AC GA ATCTACCCTA AAATACCCCG TCGCCGATTT AGGATTGGCT ACATAAAGCT 
CATTATAAGG GTATTTTGAT GACATGATAC GGTTAAATTC ATTGCCGTTG TTTATCCTGA 
T T C TAT AAA T TGGTTCAACA GCAAAGCCTC TGGATTCCCT TAATTGATTA TAATATTGCC 
TGTATGTTTG TACATCAT3T CTTGTCCACG GCTCTCCAGG AGTCCTCAGA ATAGCAATCC 
C G _ T AAn. C G G A _ C 

[2] INFORMATION FCR SEQ IT NC: -6: 

'A 1 DENOTE: 225 case pairs 

(5) TYPE: nucleotide 

;C; STRAINDZDNZSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GATCCACGCC TGTGCCTACC TTGGCTTTTT GTTCGCCAAA CAAGGCATTT AAGGTTGAGG 
ACTTGCCGAC ACCTGTCGCA CCGACAAGCA AG AC AT CC AA ATGACGGAAA CCGGCTGCTG 
TGACTTTTTG CCCGATTTCA GAAATACGGT AACGATGCAT ATGCGCTCCT AC C AGCC AAA 
AAAAGAAGCA ACCGTGCTAA TCGCCCCTCC AATCGCTTTT GCAGCACCGC CGATC 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 259 base pairs 

(B) TYPE: r.uclectide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ON A ( genomic) 

{v.; ORIGIN: 

( A ) ORGANISM; Neisseria rer.irgit idis 
(5) STRAIN : Z2491 

Ixi; S1Z"^CZ DESCRIPTION: SEQ IT NO: 17: 

GATOCAAOGG GOATOGCTGT COTTASTCGG TGTGGTTTGA OOGCTGATTT GTCCTTOTTC 

GTOAACTT CT ATGGOCTGAC GCTGTTTGCT GGGGGGGGTC TGGATAATGG TGGCATCAAC 

GAGGGCGGZG GAT GOT T TOT OTATTTTTAG GCCTTTTTCG GTCAGTTGGC AGTTAATCAG 

ITTGAGTAAT TCGGACAGGG TGTCGTCTTG CGCCAGCCAG TTGCGGTAGC GGCATAAGGT 

AOTGTAATCG GGGATGATC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(B) STRAIN: Z2491 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GATCTGTGCC GTTGATTTTA TCTTTCAGAT GCAGCATCGA ATATCGGAAA GCCAAATCAG 60 
CAATTCTTTT TGCATCGTGT GGAT7TTGAG ACGGGCCTAA TGACCGTACC CGCTTAATAA 120 
AAAAT GCAZC GZ C AAL C AAA ATGGCGGLTT TCATATTGCT TCCCCTATAT TTGT CAAAGA ISO 
L AT AAAAAA 3 CCCTTGGGAT C 2C1 
[2] INFORMATION FOR SEQ ID XO: 1 9 : 

lA) 1EXGTH: 3 34 base pairs 
[3] LYRE: nucleotide 




(u) MOLECULE LYRE: DXA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AATTCAAAGG AGGCATTTGT TGCAAGAAAA GTACAAAGTG ATTTGCAAAA AGCATTGAAT 60 

GCTAGCAACL ATAACAAGCA GCAATATGCA AGACGTGCGG CAACAGCGTT AGAGAATGCT 1 20 

TC AAAAT CAA AAGTTATGGC AGCGAATTCT TTTTGATCTA TCTTGTGCGA ACGGGTCAAA 1 80 

TATTCTTCGT ACATTGAGTT AATCGTACCA ATCGCCCTAA CCACATTTTC AT C AG AAAAT 240 

ATGGAAATAA TAGCATCCCT ATACGCACCT AGTGTAATAT TGTTTCTATT ATTAGTTATA 300 
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GCATTATTCG AATACATAAT AGCACCTCCA AATT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23S base pairs 
(3) TYPE: nucleotide 
(C) STRAND HONE SS : single 
(T; TOPOLOGY: linear 

(ii) MOLECULE TYPE: IX A (genome) 

(vi ) ORIGIN: 

(A; ORGANISM: Neisseria meningitidis 
3; STRAIN: Z24?' 

(xi SEQUENCE DESCRIPTION: SEQ 10 NO: 2C: 

AATT OCT GCG CACCTTTGCC GATGGGGAGA TAATCGCCTT TTTC-CAGCAT TCTGCCCTGA 

TGGCCGCCGA AACCGGCTTT CAGGT CGGTA CTTCTCGAAC CCATCACTTC CGGCACATCA 

AATCCGCCCG CCACGCACAC ATAGCCGTAC ATGCCCTGCA CGGCACGCAC CAGTTTCAAG 

GTCTGCCCTT ZGCGGGGGGT ATAACGCCAA TACGAATAGA CCGGTTCGCC GTCCAATT 

(2) INFORMATION FOR SEQ ID NO: 21: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 
(3) TYPE : nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(3) STRAIN: Z2491 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AATTGGGCGA GATGCTGCCG GAAACGGATT TAAAACAGAT TGCGGCGGCA GTGTTGAAGA 
CGAACGATGA GGCGGCAI7G CAGAAGGTGG TGAAAACGGC CAAAGGCAAT GCGCGGAAAC 

- - ^ ^ - Sv- _ vj^_ _ Grr . — ^_^G~:^.lA — J.GTTGCAGGT ^AACCC_ GA G GA TGG 

AT GAT GAT GT AA T C G AA Z A G GCGGAAACCT ATTTAATCCA CIAAACCTTT G A G AG AT AAG 




— , ^ — ^^-.NwZ. G ~_j-_rLrLv^ _ z.r'L— w Cc2 : 

A ZZ^3Z~~: I' 2 case cairs 

;B, TYPE: nucleotide 

[G^ STRANGEGNESS: single 

G; TGPG1GGY: linear 

(n) yCTZCULZ TYPE: GNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2491 

(xi) SZQ'ZZSCZ DESCRIPTION: SEQ ID NO: 22: 

AATTTATGTA CGGTTTTGCC GTTTGCAGTC AGCCAGTCGG CAAGGCGCAG AAAAAAATCG 

CCGACAGGGC CTTGAAGCAG CAGGATATTT TCTGCGCTTT CAAGCAGGTT TTGGAGGTTA 

TTTTTGAGGA CGGTCTGTTT CATGTTGCAA TGTGGTTTTG TTTTTTATGT AATAGTTTTA 

GGTTGAACTT TCAAGCATAC GCCAAGAGAA TT 
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(2) INFORMATION FOR SFQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(3) TYPE: nucleotide 

(C) STRANDEDNZS3: Single 

(D) TOPOLOGY: linear 

i::) M0LECC1E TYPE: TNA ( genomic ; 

■vi) ORIGIN: 

;a: ORGANISM: Neisseria meningitidis 
(3) STRAIN : Z2491 

AA. CA.G_ GC C-c^G_CA_n _CAOGG- — — — ^ _ — ^- AGGGTTACTG TAT CGCCCGC 

G GC A _ C G A 0 G GC C AA _ _ G^AGC CA-^j GGAG-^GG _ GC - GCGGGGCGG A.G^cvj. i 

TTGGATGGAT TGGGCGCGTT TGGACTGAAT CACGGGCZGC AAGGCTTGCT CGGCGTACTG 

TTTGGCCAGT ACTTCGATGC GCTTTAAATG CTTTTGGCGG CGCAATT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 
(3) TYPE: nucleotide 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN : Z2491 
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(xi ) S3QUENCE DESCRIPTION: SEQ ID NO: 24: 

GATCCAGGAC T C AAAAA C CG ATTTCCTAA? AGAGTGTCTA ATATCCCAAT CTTTTTTACC 

CCCCCCGCIG IAGAACTGAT agagaaagtt TGTCTATCTT TTTCATATAC ccazgccztc 

IT ITT AT CAT T GC A.GCT AAC AG AACC GC CA AACAATGCIT CI AG AT C 

12) INFCRVATICN ECR SEQ ID NC: 25: 

'i; SEQUENCE CHARACTERISTICS : 

'A, LENGTH: 2 51 bass pairs 
(3: TYPE: r_ucleotz.de 
(C; STRANDEDNESS: single 
Z T C P C 1 C G Y : linear 

MC1ECULE TYPE: UNA ; genome) 

'vil CRI GIN: 

;A X ORGANISM: Neisseria meningitidis 
(S; STRAIN : Z2491 

{xi; SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AATTCTTGCG GCCATTTCCT GAATGGGTTT AGTCAAAACG GGGATGAACG TTTCGTATTC 

GACGGTGTAG GTATCGTTTG TTTTATTTAC CATCGGCAAT CGACCATATT CATCTTCCAG 

CGCAGCAAT G TCCTGGGCAA TAAACCAATG CCGCAACCGA TCTTCTTTAT GACTGCCGTC 

CTTGATTGGA TTCGCCCACC ATTCGCGGAC TTTGTCCGCT CGTTCATCTG CCGGCAAGTC 

TTT GAAT AA7 T 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 
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(B) TYPE: nucleotide 

(C) ST HANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN : 

(A) ORGANISM: Neisseria meningitidis 
(3; STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AATTCCCGAC TAT CGCGGAT GCGTAGTTTT TGCCGGTGGG CAAGAGCAGG TGTGGGATAA 6 0 

GTTAOGTGAT TTGCCCGATG OCGTCAGIIT Z--ZZZZZZZZ GAATCGGTAA AT ATT GACGG 1 2C 

CTTAAAATCC G T AAAA C T C G T C GC AT T AAA ZGCTGCCGZZ CAGGCTTTTA TTAACAAGCA ISO 

CGCCGGTATC GACAGCGTAC CIGAATT 2C7 

(2) INFORMATION FOR SEQ ID NO: 27: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 base pairs 
(3) TYPE: nucleotide 

(C) STRANDSDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AATTGTTTGG GAATAATCCA AACAAACAGC ATCAGGATAG CGGCGGCGGT CAGGCTGCCT 60 
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GAAAGGATTT TGCCGGGGTT TTTTGTAGGC AAAGCGGACG AGAAACCAAA GCAACAGCAG 120 

CATGGTGTCC CAATAGCCGA TTGAGAATAG GATGGCCAAA CCTTCTAGGA AATGGCGTAA 180 

ATCG7ITGTG GTAACCATGG GTAG77CCTG TGGTTAAATG TGCAGGCTGC TTTTTGCCGA 240 

-ZZZZGZZGG A7C7CAAAAG CAGGZZZZGZ 77CAGCGT7G CG77ACGCAG 7AAAA7AA7G 3CC 

AA7A777G7A ACGGC77GGG 7A777777G7 CAA7A77CCC GCCC7TCCC7 7AACAGC7GC 360 




(v:) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

AATTCGCCGA AATCAGGC7G CTGC7CGA7A A7CGGCGCGG CCGATTGGCG TTGTGCCTCG 60 

ATTAAATCCA TCTTGTCTTG CAGACG77TG GCCTGGCCTT TGCGGCGGCG TTCGGCCAGT 120 

TGTTCCATCC GCGTTTCCGG AAATGCCGCC CGTTTGTTGC CGTTGAATAC CGCTTTGCAA 180 

ATCACCTTGC CCTGCATATC CT7CACAATC ACATGGTCGG CATCGTGGAT GTCGTAAGCC 240 

ACCCGTACCT TCTGACCGCT GTAA7CCAGC AATT 274 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 263 base pairs 
;B} TYPE: nucleotide 
(C) STRANDEDNESS: single 
(T) TOPOLOGY: linear 

.11, MO IE GYLE TYPE: TNA ^ genomic) 

; A; ORGANISM: Neisseria meningitidis 
[3; STRAIN: 22491 



W >:: SE;YE::CE TESSRIPTICN: SEQ IT NO: 29: 

i 

fU 

1^ AATTSSGTTO TTATTGGGCT TITTOOATCC AT CGGGT AT G CCTGAAGGGA ACGCAAACCC 60 

ic- 

SJ 

IJ] 



nj 
ru 



:CA0TTG0 SGATCGCTOO ATTOCGGOAT TAGCGCGTCT GACGGCAAGT GTTCTCGCGC 120 



GGAA-CAAGC CACGCCGCC GCATTGOGGC CTTGTCCTGC TGAAAACTTC GCAGTGCTTT 180 

TGOAAOCGGO C OAT CAT T AA CTTCAATCAA ATAAATCATT ATATTTGCGT TCATTTTTCC 24 0 

T AOA 00 — 03 0 OA OAT 0 0 AA ATT 263 

(2) INFORMATION FOR SEQ ID NO: 30: 

[-) SZQ-JSXCZ CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 
(3; TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
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(E) STRAIN: Z2491 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AATTGTTCAA GAAAAAAGTC GGCACGGCGC GGCAACGGGG AAAATGCGTT GACGCCGTCT 60 
TTTTCTAAGG TGATGTAGTA GGGGCGGAAA TAGCCTTCTT CAAACGCCCA GAAACTGGCT 120 
CGGTTTTCGC TTGCAA7GCG T7TTGCAATG ACGTGATAAG GGCGTGTGTC GCCAAAGCAG 180 
ACAACGGCCT GGATGTGATG TTGAGTGATG TATTCTTGCA AAAACT CAGG AAAGGCGTCG 240 
TAGTTGTCGC TAAAAACAAC GGTA7GCGC7 7GAGTGGGCG GATAAAAATA GTCGTCGCCT 300 

^r*. — i_ _ ^r-. _ _ o — 3 1 C 

[2] INFORMATION FOR SEQ ID NO: 31: 

SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 324 base pairs 
(3) 7Y?E: nucleotide 

(C) ST HANDEDNESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYRE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
{3} STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 : 

AATTCAATCA ACGGAAAACA CATCAGCATC AAAAACAACG GTGGTAATGC CGACTTAAAA 60 

AACCTTAACG TCCATGCCAA AAGCGGGGCA TTGAACATTC ATTCCGACCG GGCATTGAGC 120 

ATAGAAAATA CCAAGCTGGA GTCTACCCAT AATACGCATC TTAATGCACA ACACGAGCGG 180 
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GTAACGCTCA ACCAAGTAGA TGCCTACGCA CACCGTCATC TAAGCATTAC CGGCAGCCAG 240 

AT I T GGC AAA ACGACAAACT GCCTTCTGCC AACAAGCTGG TGGCTAACGG TGTATTGGCA 300 

CTCAATGCGC GCTATTCCCA AATT 324 

(2) INFORMATION FOR SFQ II NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23C base pairs 




(3) STRAIN: Z2431 
(x:) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



AATTATGCAA AAAAACGCAA CGCCGAAAAA CTGGCACCGC GCGGATATTG TTGCTGCTTT 60 

GAAAAAGAAA GGCTGGTCAC TTCGAGCACT TTCAATAGAA GCGGGGTTGT CGCCGAATAC 120 

GCTTAGAAGC GCACTGGCCG CCCCTTATCT TAAGGGAGAA AGGATTATTG CCGCTGCAAT 180 

CGGAGTGGAA CCGGAAGAGA TTTGGTCCC-A ACGGTATGCA GAT CGG AATT 230 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A) ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2491 

(x:) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TAATCG GCGGAATGCC TGTTCAACCG CACCAATCCC GCTGAATACG GTTGCTAATC 60 

AT 31 3 A ATCAGGTTTA A. G AAAA GT T T TAGATTTCCA ACCTTGTTGA C T GGGAAAGA 120 

A.TTTTT TTGTAATCGA GTATCGTGTG TCTGTGCCAT TGTCGAAATA GTCATACTTA 180 



249 



inecrma: 



:n ecr se; 



IT NS: 34: 



SEQUENCE CHARACTERISTICS: 
[k] LENGTH: 34 3 base pairs 
(3; TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGIN: 

(A; ORGANISM: Neisseria meningitidis 
(3) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

AATTCTTGTC CCGGAGTCCA ACGTATATTT ACCCTCCTGC GAGCTAAAAG ACTATTATTC 60 

TCCACTGCCA CAGTAGCCGC ATTCACCGCC GTATTCACAT CCCCTTTAAC CAATGCCACT 120 
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GCGCTGCCTG CGATAATCTG CGAGTAGGCT ATGACTTTTT GGCGTTCTTG GGGTGACAGT 180 

TTGCCTACAT CGCGTCCGTC CAACAGGGTT TCTCCCACCA TCTCGCCGAC TGCCGCGCCG 240 

ATTGCGCCGT CCCGACATTT GCCTTTATTT GCTACCGCCG ATGCACAGCC TGCTACGGCA 3 00 

TGGGCTATCT TGTGGGCAAI GTAGTCTTCG CTGAGATTAA ATT 34 3 
(2) INFORMATION FOR SEQ ID NO: 35: 




(A) ORGANISM: Neisseria mer.mgit idus 
(3) STRAIN: Z2491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

AATTCTTCAA ACATCGTTTC GA T AAT C GGG TCGGTGTACA CACTGATGCG GTCGCCCGCA 60 

CGGCTTTGAC CGGCTCGGAA AATATAGGCG GTGGCTTTGC CGTCGGCGAT GTCGACGCAC 120 

CAACGCCAGA TGGCGTCTTC GGTATTCAAA CAATCACCCG CACAGCTTTC ACCTGCGCGG 180 

AATT 1 84 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TATGCTCAAT CTCATTTTCA AAATGCAAAA CTTTTCTGAT TTTTCCTACT TTTTGCTCAA 60 

TATTAGGAAG GTTTTAGGCA ATTGAAAATT TTTTGGCGCA TTTTTATGCG TCAAATTTCG 1 20 
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TTAACAGACT ATTTTTGCAA AGGTCTCCGT CTGTAAAAGC AAGGATAGGG CATCTGCCCT 180 

X77GA77GTT 7GA77AACGA TACAAGGAGT TTCAAAATGA GAGTTTTATA GTGGATTAAC 240 

AAAAACCAGT ACAGCGTTGC CTCGCCTTGC CGT ACTATTT GTACTGTCTG CGGCTTCGTC 300 

GCC77G7CC7 GA777AAA77 7AA7CCACTA 7A7G7G7TCA XGAAA7GAC7 7GGG7CGGAG 36 0 

GC7CAGG7AA 7GCGCAACAA AG77CA7A77 A77GCGAAAT 7TGCGAATC7 GCAGGGG77A 4 20 

ACGA7ACGGG AAA7CC7GA7 AAA7C777AG G A7 7 GCC AAA CAA7AGGT7G AG7AA7CGGC 4SC 

G7GG77GGGG AGC7ACAA7C GGAGC777AG CAGGTAGCCG CA7AGG7ATG CG7GAA777G 54 0 

G7AC3777GC GAGCCA7GCC A77GAAAA77 7CGAC7GG7C A7GG7A7CGA CG77A7A3GG 5CC 

AAA77GCC3A AACGA77GAA CGA3AA7A77 CAGGCGG777 GCCTTAATAG 77GAGGAGG7 55C 

CA7GA7G777 GCCAAACA7T A7CAA77CA7 CGCACTCGGC ATCATGCTGC 77C777A7A7 7 20 

G77GA77C7C 7A7ACGACCG A77777CCAA 7C7GACGTA7 TGGATGCTG7 77777A7CTG 78 0 
7777A77ACA GGAAAAA7A7 7AGC7CG777 G77AGAGAAA AGCTTTAAA7 AAAA7AGCAG 840 
CTAG7CGCAA AAGG7CG7C7 GAAACCTTTT CAGGCGGCCT TTCTAAAATA CATCCAACTT 900 
CC7AAICCC7 ATT7 I XCAAA AAGGAAATCT ATGCCCCATC TGCAAAACCT GTCTTTGGGC 960 

T7AAAGAAAA AGC7GCCTGT 7A7CC7GCAA ACAGAAATAT CAGAATGCGG CTTGGCATGT 1020 

CTGGCGGC7G TGGCGGGATT TCATGGTTTC CATACGAATT TACGCGCACT GCGTTCAAAA 1080 

TACTG7CCGA GACCT7TGCA AAAT7CCCCA AAATCCCCTA AATGTCTTGG TGGGAATTTT 1 1 40 

GGGGAA7TTT GCAAAGGTCT CATTC7A7AA CTGTAAATAC TTTTAAATTT ATGACAAAAT 1200 

AGTAAA7A77 GC7AAAA7AA TAT7GATG7C A7GAAATTTT TTCCTGCTCC ATGTCTGT7G 1260 
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GTTATCCTGG CTGTCATACC CCTTAAAACC TTAGCTGCCG AT GAAAACGA TGCAGAACTT 
ATCCGTTCCA TGCAGCGTCA GCAGCACATA GATGCTGAAT TGTTAACTGA TGCAAATGTC 
CGCTTCGAGC AACCATTGGA GAAGAAGAAT TATGTCCTGA GTGAAGATGA AACACCGTGT 
ACTCG3GCAA ATT AC AT TAG TTTAGATGAT AAGACGGCGC GCAAATTTTC TCCTCTTCCT 
TCC3TGCTCA T G AAAG AAA C AGCTTTTAAA ACTGGGATGT GTTTAGGTTC CAAT AAT TT G 
AGCAGGCCAC AAAAAGCCGC GCAACAGATA CTGATTGTGC GTGGCTACCC CACTT CCC AA 
GCTATTATCC AACCACAGAA CATG3ACTCG GGAATTCTGA AATTACGGGT ATCAGCAGGC 



. 3-~-_A G-"_~ — r; — ^_ ^ vjG A — GGGA AG-V,-G^^. o-_~ c^c;^. - — - - . 



~i^7_ w v__~ 



CTCA ACAACAAAT7 TCCCCTATAC AGGAAGAAAA TTCTCAATCT 7GGCGA- G_ A 



CA3CA333CT T G G AAAA C C 7 GCG7C3777G CCGAGTGTTA AAACAGATAT 7CAGA77ATA 
CCGTCCGAAG AAGAAGGCAA AAGCGATTTA CAGATCAAAT GGCAGCAGAA T AAA C C CAT A 
CGGTTCAGTA TCGGTATAGA TGA7GCGGGC GGCAAAACGA CCGGCAAATA TCAAGGAAAT 
GTCGCTTTAT CGTTCGATAA CCCTTTGGGC TTAAGCGATT TGTTTTATGT TTCATATGGA 
CGGGGTTTGG TGCACAAAAC GGAGT TGACT GATGCCACCG GTACGGAAAC TGAAAGCGGA 
TCCAGAAGTT ACAGGGTGGA T7ATTCGGTG CCCGTAAAAA AATGGCTGTT TTCTTTTAAT 
C AC AAT GG A C ATCGTTACCA CGAAGCAACC GAAGGCTATT CCGTCAATTA CGATTACAAC 
G3CAAACAAT AT C AGAGC AG CCTGGCCGCC GAGCGCATGC TTTGGCGTAA CAGGTTTCAT 
AAAACTTCAG T C GG AAT G AA ATTATGGACA CGCCAAACCT ATAAATACAT CGACGATGCC 
GAAATCGAAG TGCAACGCCG CCGCTCTGCA GGCTGGGAAG CCGAATTGCG CCACCGTGCT 
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TACCTCAACC GTTGGCAGCT TGACGGCAAG TTGTCTTACA AACGCGGGAC CGGCATGCGC 2460 

C AAA GT A T GC CCGCACCTGA AGAAAACGGC GGCGGTACTA TTCCAGGCAC ATCCCGTATG 2 520 

AAA A.7CATAA CCGCCGGA77 GGA7GCAGCG GCCCCGTTTA TGTTGGGCAA ACAGCAGTTT 2 580 

77C7ACGCAA CCGCCA7TCA A GG 7 C AACT GG AACAAAACGC C777GGT7GC CCAAGACAAG 2640 

77G7C7A.7CG GCAGCCGC7A CACCG77CGC GGATTTGATG GGGAGCAGAG TCTTTTCGGA 27 00 

GA.GCGA.GG77 7C7AC7GGCA GAA7AC777A ACTTGG7ATT 77CA7CCGAA CCA7CAGTTC 27 60 

7A7C7CGG7G CGGA.C7A7GG CCGCG7A7CT GGC GAAAGTG CACAATATG7 ATCGGGCAAG 2S20 

CA.GC7GA.7GG 37 33 A 33 337 C GGC 7 7 GAGA GGAGGGGA7A AAG7AGGCGG 7A7G777GC7 2880 

7A7GA7C7G7 77 333 333A3-. GCCC-C77CA7 AAACC CAAAG GC7TTCAGAC GACCAACACC 2940 

37 7 77. 3 3 337 73A„A377 3 A3-. 77A.3A.G777C TAACCTC7GA A7TTTTTTAC TGA7ATTTAG 30 CO 

A 3 3-37 37773 C77A7CC7CA GAC7G7CAAA CT77ACCTAC G7ACTTGGCG CGGAGTACGT 3060 

7GA7C77GAA AA7GGAA7AG .AG A7 GAAT AA AGGTTTACAT C GC AT TAT CT TTAGTAAAAA 3120 

GGAGAGGACC A7GG77GCAG TAGCCGAAAC TGCCAACAGC CAGGGCAAAG GTAAACAGGC 3180 

AGGGAG77CG G777C7G777 C AC7 GAAAAC TTCAGGCGAC CTTTGCGGCA AACTCAAAAC 3240 

CAGCCTTAAA ACCTTGGTCT GC7C77TGGT TTCCCTGAGT ATGGTATTGC CTGCCCATGC 3 300 

CCAAAT7ACC ACCGACAAA7 CAGCACCTAA AAACCAGCAG GTCGTTATCC TTAAAACCAA 3360 

CACTGGTGCC CCC7TGGTGA A7ATCCAAAC TCCGAATGGA CGCGGATTGA GCCACAACCG 34 20 

CTATACGCAG TTTGATGT7G ACAACAAAGG GGCAGTGTTA AACAACGACC GTAACAATAA 3480 

7CCGTTTCTG GT C AAA GGG A GTGCGCAATT GATTTTGAAC GAGGTACGCG GTACGGCTAG 3540 
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CAAACTCAAC GGCATCGTTA CCGTAGGCGG TCAAAAGGCC GACGTGATTA TTGCCAACCC 3600 

CAACSGCATT ACCGTTAATG GGGGGGGGZT TAAAAATGTC GGTCGGGGCA TCTTAACTAT 366 0 

CGGTGCGCCC CAAATCGGCA AAGACGG7GC ACTGACAGGA TTTGATGTGC GTCAAGGCAC 37 2 C 

A77GACCGTA GGAGCAGCAG G77GGAA7GA TAAAGGCGGA GCCGACTACA CCGGGGTACT 3780 

7GC7CG7GCA GJTGCZZZGG AC-GGGAAA77 AGAGGGTAAA AACCT GGCGG 777C7ACCGG 3S4C 

7 CCGC AG AAA G7AGA77AGG CGAGCGGCGA AATCAGTGCA GGTACGGCAG CGGG7ACGAA 39C0 

AGO GAC7 A77 GCCGTTGATA C7GCCGCAC7 GGGCGGTATG 7ACGCCGACA GCATCACACT 396C 

GA77GCCAA7 G AAAAA G G G G Z AGGCG7 CAA AAA 7 GGCGGC ACACTCGAAG CGGGCAAGCA 4C2C 

A77GA77G7G AC77GG7GAG GCCGCA77GA AAAGAGCGGC CGCATCGCGA CCAG7GCCGA 4C£C 

GGGGACCGAA GGT7CAGGGA C77A7C7G7C CA7CGAAACC AGCGAAAAAG GAGCGGCAGG 41 4C 

GAGA777A7C TGCAA7GG7G G7CGGA7CGA GAGCAAAGGC TTATTGGT7A T7GAGACGGG 4 2CC 

AGAAGA7A7C AGCTTGCGTA ACGGAGCCG7 GGTGCAGAAT AACGGCAGTC GCCCAGCTAC 4 260 

GACGGTA77A AATGC7GGTC A7AA777GGT GATTGAGAGT AAAACTAATG TGAACAATGC 43 2 0 

CAAAGGCTCG GCTAATCTGT CGGCCGGCGG TCGTACTACG ATCAATGATG CTACTATTCA 43S0 

A GGGGGC A G7 TCCG7G7ACA GC7CGACCAA AGGCGATACT GAATT GGGTG AAAATACCCG 44 4 0 

7AT7ATTGCT GAAAACGTAA CCGTATTATC TAACGGTAGT ATTGGCAGTG CTGCTGTAAT 4 500 

7 GAGGCT AAA GACACTGCAC ACATTGAATC GGGCAAACCG CTTTCTTTAG AAACCTCGAC 4560 

CGTTGCCTCC AACATCCGTT TGAACAACGG TAACATTAAA GGCGGAAAGC AGCTTGCTTT 4620 

ACTGGCAGAC GATAACATTA CTGCCAAAAC TACCAATCTG AATACTCCCG GCAATCTGTA 4680 
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TGTTCATACA GGTAAAGATC TGAATTTGAA TGTTGATAAA GATTTGTCTG CCGCCAGCAT 
CCATTTGAAA 7GGGA7AACG CTGCCCATAT TACCGGCACC AGTAAAACCC TCACTGCCTC 
AAAAGACATG GG7G7GGAGG CAGGCTTGCT GAATGTTACC AATACCAATC TGCGTACCAA 
G733337AA7 C733A3A77C A GGC A GCCAA AGGCAA7A77 CAGCXTCGCA A7ACCAAGC7 
GAACGCAGCC AAGG37C7CG AAAGCACCGG AT 7 GCAGGGC AA7ATCGTTT CAGACGGCC7 
73A7337377 737G3AGACG G7CA7G7A7C C77AT7GGCC AACGG7AA7G CCGAC777AC 
CGG7C AC AA7 ACCC7GACAG 3CAAGGCCGA TGTCAA7GCA GGATCGGTTG G7AAAGGCCG 

CAACGG7A77 CA33773G7G AC33AAAACA ACGCAA77CA A7CAACGGAA AACACA7 CA3 
CA7CAAAAAC 3.A3337 337A A73333A377 AAAAAA C C 7 7 AACG7CCA7G C C AAAAG 3 GG 
GGCA77GAAC A77CA77CCG AC CGGGC A7 7 GAGCA7AGAA AATACCAAGC 7GGAG7C7A3 
CCA7AATACG CA7C77AA7G CACAACACGA GCGGGTAACG CTCAACCAAG TAGATGCC7A 
CGC ACACCG7 CA7CTAAGCA 77ACCGGCAG CCAGATTTGG C AAAAC G AC A AACTGCCTTC 
7GCCAACAAG C7GG7GGC7A ACGG7GTA7T GGCACTCAAT GCGCGCTATT CCCAAATTGC 
CGACAACACC ACGCTGAGAG CGGG7GCAAT CAACCTTACT GCCGGTACCG CCCTAGTCAA 
GCGCGGCAAC A7CAA77GGA G7ACCG77TC GACCAAGACT TTGGAAGATA ATGCCGAA7T 
AAAACCATTG GCCGGACGGC 7GAATATTGA AGCAGGTAGC GGCACATTAA CCATCGAACC 
TGCCAACCGC ATCAGTGCGC ATACCGACCT GAGCATCAAA ACAGGCGGAA AATTGCTGTT 
G7CTGCAAAA GGAGGAAATG CAGG7GCGCC TAGTGCTCAA GTTTCCTCAT TGGAAGCAAA 
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AGGCAATATC CGTCTGGTTA CAGGAGAAAC 
CAAAAACTTG GTTGTCGCCA CCACCAAAGG 
AITCAGCAAT TATTTTCCTA C A C AAAAA GC 




GA C C GC A C AA GG C A _ CC AAA _ CAo^bG 

^_ _ A w C v_ o- w.r-GGC ^ _„*-„_ » ^ ^.r v^ro-w 

_ ^r_— . ^ vj: w . r* ~ ~- ^ w-TL v_, _ ~^TL _ _ — — w 

AGCCGCTCCG AACAAGCCTT CACGCTTGAC 

_ ^CG-cCrC - C GA_ oA. GCn^ _ A — 

CAGCACAGAC ATCAAAGCCC AT ACT GAT AT 
TACCTTCTTA AAAACCAAAG GTAAAAGCGG 
CACCCGCGAC CACCTGATTA TGCCAGCCCC 
TC AGGCAGGC GGCAACATCG AA GC T AA TAG 
TACCCTGGTT GCGGGTGAAG AGCTGCAACT 
G7TGGATGTC CAAAAAAGCC GCCGCTTTAT 
TAAAAACGAA CTGAACGAAA CCAAATTGCC 
CCGTTCAGGC TGGGATACCG TGCTCGAAGG 



AGATTTAAGA GGTTCTAAAA TTACAGCCGG 
CAAGTTGAAT ATCGAAGCCG TAAACAACTC 
GGCTGAACTC AACCAAAAAT CCAAAGAATT 
CTCGCCTAAA AGCAAGCTGA TTCCAACCCT 
TATTCAAGCC AT CAACAAGG AA GT T AAA GG 
A GC C AAGCT T T C T GC AC AAA. AT AT .■. GAC T x 
TTCCGATATT ACCGCTTCCA AAAAACTGAA 
AGCAGATTCA GAC-GCGGCTG CTATTCTGAT 
CAAGCCCACC TACAAGAGTC ACT AC GAC AA 
CGGACGTACG GGGGIAAGTA TTCATGCAGC 
CGGTGCAXCC G AAAT C AAA G CICCCTCAGG 
TGTACTGGAG GCTGGACAAA ACGATGCCTA 
CAAAATCATC AGAAAAACCA AGTTTACCAG 
CGTCGAGCTG ACCGCCAACG GTATCACGCT 
CACCCGCTTC AATGCCCCTG CAGGTAAAGT 
GCTGGCAGAA GAAGGCATCC ACAAGCACGA 
CGGCATCAAG GTAGGTAAGA GCAATTACAG 
TGTCCGCGTC GTCGCCCAAA CTGCAGCCAC 
TACCGAATTC AAAACCACGC TGGCCGGTGC 
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CGACATTCAG GCAGGTGTAG GCGAAAAAGC CCGTGTCGAT GCGAAAATTA TCCTCAAAGG 
CA7XGCGAAC CGTATCCAGT C GG AAGAAAA ATTAGAAACC AACTCAACCG TATGGCAGAA 
ACAGGCCGGA CGCGGCAGCA CTATCGAAAC GCTAAAACTG CCCAGCTTCG AAAGCCCTAC 



TTGTCCGCAC CCGGCGGCCA CA7CGTCGAC ATICCGAAAG GCAATCTGAA 



X CGAGTATGCC TATGTGAAAC AGCTCCAAG7 



ATCAGGTGCA GCCTGCTTAC GACAGAT GGG ACT AC AAA C 



GGAGGGCCTA A C C G AA> GC A G GCGCGGCGAT "A3 C GC AC TG GCCGTTACCG TGGTCACCTC 



— • w -j: ~_ ".www O- - . w w w -^r - ~- 



AAACGG7GCG GCCGCCGCCG CAACCGA_G: 



:C CGTATCGTTC AT CAACAACA AAGGCGATG: 



C C T G AAA G A G C 7GGGCAGAAG CAGCACGGTG AAAAATCTGG TGG.-GCCGC 
AGCCG AC AAAA.7 C GG CGCTTCGGCA CTGAACAATG TCAGCGATAA 



- ~, w w w ^ n -w-vjj ^, o- 



GCAG7GGA7C AACAACCTGA CCGTCAACC7 AGCCAATGCG GGCAGTGCCG CACTGATTAA 
TACCGCTGTC AJ.CGGCGGCA GCCTGAAAGA CAATCTGGAA GCGAATATCC TTGCGGCTTT 
GGTCAATACC GCGCATGGAG AAGCAGCCAG TAAAATCAAA CAGTTGGATC AGCACTACAT 
AGTCCACAAG ATT GCCCATG CCATAGCGGG CTGTGCGGCA GCGGCGGCGA ATAAGGGCAA 
GTGTCAGGAT GGTGCGATAG GTGCGGC7GT GGGCGAGATA GTCGGGGAGG CTTTGACAAA 
CGGCAAAAAT CCTGACACTT TGACAGCTAA AGAACGCGAA CAGATTTTGG CATACAGCAA 
ACTGGTTGCC GGTACGGTAA GCGGTGTGGT CGGCGGCGAT GTAAATGCGG CGGCGAATGC 
GGCTGAGGTA GCGGT GAAAA ATAATCAGCT TAGCGACAAA GAGGGTAGAG AATTTGATAA 
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CGAAATGACT GCATGCGCCA AACAGAATAA TCCTCAACTG TGCAGAAAAA AT ACT GT AAA 8160 

AAA GT A 7 C AA AATGTTGCTG ATAAAAGACT TGCTGCTTCG ATTGCAATAT GTACGGATAT 8220 

A7CCCGTAGT ACTGAATG7A GAACAA7CAG AAAACAACAT TTGATCGATA GTAGAAGCCT 8 280 

7CA77CA7C7 IGGGAAGCAG G7 C 7 AA 7 7 GG 7AAAGA7GAX GAA7GGTA7A AATTA77CAG 8340 

CAAA7C77AC ACCCAAGCAG A777GGC777 ACAG7CT7AT CATTTGAATA CTGC7GC7AA 84 CO 

A7C77GGC77 CAA7CGGGCA A7ACAAAGCC 777ATCCGAA 7GGA7GTCCG ACCAAGG77A 8460 

7ACACX7A77 7CAGGAG77A A7CC7AGA77 CA77CCAATA CCAAGAGGGT TTGTAAAACA 65 20 

AAA7ACACC7 A77AC7AA7G 7CAAA7ACCC GGAAGGCA7C AG7T7CGA7A CAAACC7AAA 8 5S0 

AAGACA7C7G GCAAA7C-C7G A7GG7777AG 7 CAAGAACAG GGCATTAAAG GAGCCCA7AA 564C 

CCGCACCAA7 777A7GGCAG AAC7AAA77C ACGAGGAGGA CGCG7AAAAT C 7 G AAA CCCA 87 CO 

AAC7GA7A77 GAAGGCA77A CCCGAA77AA A7A7GAGATT CCTACACTAG ACAGGACAGG 87 6G 

7AAACC7GA7 GGTGGA77TA AGGAAA777C AAGTATAAAA ACTGTTTATA ATCCTAAAAA 8820 

A7777CTGA7 GA7AAAA7AC 77CAAA7GGC 7CAAAATGCT GCTTCACAAG GAT ATT C AAA 888 0 

AGCCTCTAAA ATTGCTCAAA AT GAAAGAAC TAAATCAATA TCGGAAAGAA AAAATGTCAT 8940 

TCAATTCTCA GAAACCTTTG ACGGAATCAA ATTTAGATCA TATTTTGATG TAAATACAGG 900 0 

AAGAATTACA AACATTCACC CAGAA7AA7T TAAAGGAAAA ATTATGAAAA ATAATATTTT 9060 

7C7AAACTTA AATAAAAAAT CTATAAA7AA CAACCATTTT GTTATTTCGA TTTTTTTTGA 9120 

AACAATTTAC CAATTTGAAA CTAAAGA7AC GCTTTTAGAG TGTTTTAAAA ATATTACAAC 9180 

TACCGGACA7 TTTGGAGTAA TAGG7GC7CA ATATGAAAAA ATAGATGCTA CCAGATGGAT 9240 
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TGGAGATTAT GAAGAGGTAA ATGGATTTGA GTATATTGAT AAAGCTCCTT CTATTTATTT 9300 

77CAG7TGGA GATGATTTCA ATCCTGAAGA ATTAATTATA CCTATTAATT TAGCATATCA 9360 

77ACTTTAAT ATTGCAATAT CTGATTTCTT AATAGCTCAC CCTGAATATC AAAAAAAGTG 94 20 

TAJ^-AGAAATA CAAAAAACAT AZ 7C 7 CAAAC AAACTG7AGC C X GC AT G AAA CCTAAAATGC 94 8 C 

A7GGG7AAGG TG7G7GG77C AGGAGGGAGG CGTTCCATGA 777ACGGCTC AATGCCGTCT 954 0 

GAAAAGC7GA CAA777T7GA GACGGCA777 G77ATGCAAG 7AAA7A7TCA GA77CCC7A7 96 CC 

A7AC7GCCCA GACGCG7GGG 7GG7GAAGAG AGGCCCTACG CT7GCTGCAG AAC7TTGGGG 966 C 

7AAAAGGGG7 G7GAGGA77A GGGGAGGG7A 7GGGAA7GAG AAGAG7GGGA 7GG7GG7GAG 972C 

CAGGAGGGA7 A7GAG77GGG AAAAC GGG AA AA7CAAAA7T GAA7G77AGG G7GACCAA7A 97S2 

77AG7A7GGG AGAGAGAGGG AAC7G7A7AG G777GAACGG CGGAGG7ACA AAAC7GGGAA 9640 

A7GG7AGAAC G GC AAAC A G A 7 7 A, G C G AA G 7 C AAAGAAC A C AAAAAC GCC A AGCCCGAGGC 9900 

A GT AAA C C 7 C AGCGCA7CCC AAGGCATCGA CATCAAATCT GGTGGCAGCA TCGACGCC7A 9960 

CGCCACCGCA TTCGATGCCC CCAAAGGCAG CATTAACATC GAAGCCGGGC GGAAATTGAC 100 20 

ACT CT AT GCC GTAGAAGAGC TCAACTACGA CAAACTAGAC AGCCAAAAAA GGCGCAGATT 1 0080 

TC7CGGCATC AGCTACAGCA AAGCACACGA CACCACCACC CAAGTCATGA AAACCGCGCT 10140 

GCCC7CAAGG GTAGTTGCAG AATCAGCCAA CCTCCAATCG GGCTGGGATA CCAAACTGCA 10 200 

AGGCACACAG TTTGAAACCA CACTGGGTGG CGCAACCATA CGCGCAGGCG TAGGTGAGCA 10260 

GGCACGGGCA GATGCCAAGA TTATCC7CGA AGGGATCAAA AGCAGCATCC ACACAGAAAC 10 320 

CG7GAGCAGC AGCAAATCTA CTCTATGGCA AAAACAGGCA GGACGGGGCA GTAACATCGA 10 380 
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AACCTTGCAA TTGCCGAGTT TCACCGGTCC CGTTGCGCCC GTACTGTCCG CACCCGGCGG 
77ACA77G7C GACATTCCGA AAGGCAA7CT GAAAACCCAA ATCGAAACCC TCACCAAGCA 
GCCCGAGTAT GCTTATTTSA AACAACT7CA AGTTGCGAAA AACATCAACT GGAATCAGGT 
C-CAGC77GC7 7ACGA7AAA7 GGGAC 7 AC AA ACAGGAGGGG A7 GACACCCG CAGGAGCAGC 



^v;o.---.v,vc^v ccCc^cc-.-c v^Cv^^-- vjrvjrr-.vjr^ GoGA.GGA.GCC G ^ A G C A G ^ r-^A C\jGCAo^CGc 
AAC7GGAG7A GCAGCAGGAA CGGCAGCCAC AACCGGAGTA GCAGCAGGCA CA7CAGC 7 GC 



C C C _ ^ A _ C A_— . w.-_r-* _ AAA o G A w AC A _ -^nn CCA7ACCC7G AAAGAAC7GG GCAAAAGCAG 

CACCG7CAGA CAGGCCGCCA CCGCCGCCG7 AACCGCAGGC G7AC7GCAGG GCA7AAGCGG 
GC7 GAAC AC C CAAGCAGCCG AAGCCG7 CAG CAAACA7T77 CACAG7CCCG CAGCAGGCAA 
AC7GACCGC7 AACC7GA7CA ACAGCACCGC 7GCCGCAAGT GTCCATACCG CCATCAAC GG 
CGGCAGCC7G AAAGACAAC7 7GGGCGA7GC CGCACTGGGT GCGATAGTCA GTACCGTACA 
CGGAGAAGTA GCGAGC AAAA 7CAAA777AA TCTCAGCGAA GACTACATTG CCCACAAGAT 
AGCCCA7GCC G7AGCAGGC7 G7GCA7CGGC GGTAGCAAAT AAAGGCAAAT GTCGGGACGG 
CGCAA7CGGC GCGGCAG7CG GCGAGA7GGT GGGAGAAACC CTGTTGGACG GACGCGATGT 
A GGC AAAC7G 7CACCCCAAG AACGCCAAAA AGTCATAGCC TACTCGCAGA TTATCGCAGG 
CAGCGCAG7G GCA77GG77A AAGGGG AT GT GAATACGGCG GTGAATGCGG CTACTGTGGC 
AG7GGAGAA7 AA7AG7C777 7AGC7CGCAG GAGGGTAAAT ATACGTTGGA C7CCGCGACA 
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AGAAITGGAA CATGAATATG CCATTCTTGA 

GCTGGA7CCG AAAI7TAACG GGA77GCTAT 

A CAT GAT GT A CAAACATACA GGCAATATTA 

_ o — ^7--_A G A. A A. jl A.GAA. - Cn GGATAAA 

A _ G AAJAA _ A.C C G A T AA7 G A-GC _ TTATGT 

_ ^ ^ _ .*-. . - _ v_ G _ ~. _ GA* _ C C _ vj; ■w ^nGAAiG 

— ^ - CAAA- _ G CAA.GAAAG _ A_ CGGGGA*T7GG 

— v r^r_.-_s__ ^ _ G C-T 1 - - >r ^_ AA. vj: 

_ ^r--_~. ~ <: G A. G A--i G GG_GG 

AA. GG wA G .-.A ,j A_A G AAA A. _ G A _ GA _ 

A 777777A77G 7GAGCAG7GG 
TTCTCTGAGA AA G AA. GCZGA GSAAAAACAT 
GG77CGGCGA C AC AACCT GA, ATATGTAATT 
GT AAA7 77 7 T TCGATAAATT TGGAAGGGAT 
AA77CGAA77 ATTATTTTCT TTCTATGGCT 
GAT TACT TAG C AG AAGG AT A TACTTATTTC 
TTGAAACAAG ATTTTATTAA TAATGAGCGA 
GATAAGGTAA TTCTATTTCC AAAGTTTGGT 
ATT7AA77AA GT77TAAGGC CGTCTGAAAA 



81 

AATCCAGGCC ATTACCAATC AAATCCGAAG 
TCTGAGGACT CCTGGAGAGC CGTGGACAAG 
TAATCAATTA AGGGAATCCA GAGGCTTTGC 
CAACGGCAAT GAATTTAACC GTATCATGTC 
AGCCAATCCT AAATCGGCGA CGGGGTATTT 

G G AAA. x T A — ^ G AA. GAAAA _ A GC GA*A _ i 

TTATATCAAG GAGGCTGTTA GAAAATATAG 
_ A. v_ A C C _ A >_ _ A. A _ AA. ^ A. G o j-iA Gj--A~A G G - 
7CAGGTCAAT CGAATTCGAG AA7CTGTA.Tr 
7AGAGATACA ACAGGAAGGA TTTACAAATG 
TCTTATGGTT ATAAGAGACT TCA7AAGGC7 
C7TAAAGGGG AG7TA7ATAC TGCCGTAATA 
ACCTTGCGAG AGGAAGTAGG TTTTT7TTCG 
TATTTAACCC ATCAATTTCA AAAATATTCC 
GTATGGAGAG ATTATATAAC TTTGGAATCT 
TTCAATGAAA ATACGGATGA TTGCTATGTT 
TATGAAAAAA CAGAATTATA TTCCCAAAAA 
GAATATGATT TGGTGTTAAA TCCGGACATT 
AAATTTCAAA CGGCTTTTAT TATTGGGTTT 
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GGAATCTGAG GATAAAGCTG ATAAAAACCA 
77GTACAGAC TAAAGGCAGC AATCAAATCA 
A7ATGGAA7A AICATGAATA A GAGAAT GAA 



AG7A7GGC7C AAA GG7 A7 GA A7A7A7777A 




GG7GAGGG77 7 GG7 7 AAGAA 7AC7GA777C 
GG7AAAGGGA AGA77AC7GC C7 A77 CAAAA 
GGGGGGGA7G 7GAA7AGAGC GGGGAA7GCG 
XATCC7AGAT GCGTTGGTGC AAAG7GTGAT 
CGTGAAAATC CTGAAGAA7A TCGAGAAGTT 
A7CGG7GATA TACAGAGTTT TGTACAAGCA 
CTGGGTGTGG TTCCGGGTAT CGG7GAA7CG 
AAAAA7 x T AC AAGGCATGAA AAAAGCCTTG 
GGCTATGTCA GCAAAACCAA AA7 C AAAAT C 



82 

GGAAATTATC AGATTGCTAT ATACGTATTG 127 20 

CTATTGCTTA CCCACAAAAA TAAATTGATT 12780 

AATGTGTCCT GCTTGTCAAC AAGGCTATCT 1284 0 

7GAAA77A77 C7G7G7GA7G AA7GCGA7GC 12900 

TGGAGAATA7 GAAAAAGATT TT7A7TC77A 12960 

GAGTGAA7G7 A777GGGAAG GAGA777G77 13020 

AAATGATATG GA7TGATGGA AA7777AAGC 13080 

G7AA7CA7AC GCAAGAT7A7 CAACAGAGAG 13140 

GCCCA7GCCA 7AGCGGGC7G 7GCGGCAGCG 13200 

GCGATAGGCG CTGCAGTCGG 7GAGA77G77 13 260 

AGTCGTATGA G7GCGACCGA AA7CGAAAAA 13320 

CTGGTTGCCG GCACTGCG7C TGCCG77GTA 13380 

GCACAGATAG CGGTGGAGAA TAA7AC7TTG 1344 0 

GAATTTCAAA AGGAACAACA AAAA7GGATA 13500 

TTGCTTTTTC AGACAGGA7T TATTCGAATT 13560 

CAGACCGCTG CCGATCACCT GTTTGCTTTG 13620 

ATACAGGCCT ATAAAGTAGC GAAAGCGGCA 13680 

GACAAGGCAG CAACCGTTGC CACTGCACAG 13740 

GGTCAAACTG AATTAAGGGT TACTGCAGCA 13800 
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ACTGACAAAC AATTGCTGAA AGCTATTGGC 



GAGCA.GCTAT TTGACTCTT7 AGCT AAAC AA 



TACGGGGGAA ATAACGGTTT TGATCATGTA 



Gor.. ~. _ ~- G A_-— ~ _ oA G - A. o G — o — 



G C AG ^ C G G A „ CG_ C AAA. G A GGT i-.A.GGC G 




5 ..^ A- A — — _ vj: GG*-rv_r._ As- G 

_ A. GjT. A - w _ -~ - vr _ G AAr-- sj: — - _r---r-- c^v-c >_ 

If 1 

S- ACG- iA^AGG . c^Ao-^i _CC C A _ G — C _ A _ C 

Q 
4= 

?y AAC AT GCTT A TGTTGCAGGA AAGGTGGGGG 
PJ 

yp AACCCTTCTT CCTTCCCTGT GACATGCTCA 




TGAGCGACAG CCCACAGCTG CGTGAGTTTC 



ATACAGAAGG CCTTATAAAC CGCTACGACC 

TGATGGTGGA G GGT AAGCAG CTTGATCGGT 

ATCCGACCCC T AGG AAA T GG CTGCAAAAGC 

TCGGCGAACA GGATGCCGAG GCAATGAAAG 

GGGCGGGTAT GGCTGCCAAA GAAAGATTGT 
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GAAGGAAGGG ACACGACAGG TAAAATGACC 13860 

AATGGCTTCA GAGTGCTTTC GGGCGGC AAA 13920 

TGGCAGGCTG CCGATGGTAG TGTCGTTTTG 13980 

As-^G j. ACAGC iGAA^GCGAA. i Gvj_ vj^Gco- i4^4J 

AGAGAAGTTT 7AGATCAA77 ACCCGACGGC 141 CC 

AATAAGAAGG GC A C A T T AAA AACAGCAA7A 1 4 1 €C 

GTTATTCTTC CTGICAAAGT TCGTTGTAAA 1422C 

ATATGATGAC G A G G C AAAJLA CGGTATGAGC ' 4 25 2 

CCAATTTCAA TAGGGGTTGG CTTGACCCTA 1 4 342 

GGGTTGGAGC T AT GG AAC AT ATTTTGATGG 1 4400 

TATTTGAACA TGATCTTAAG AAATTGAAGC 1 4460 

TTTTGCTGAG TGTAAATTCT ACAGAGCGTG 14520 

AC ATT CAAAA TCCGATGTTT CTGATGCTGA 1 4530 

TGGTGCGCAA TATCGACAAC AT C GC C AAC G 14640 

TCAACCGGCA TATGATTTAC AATACTCCGC 14700 

T GAAACAACG TAGCGAGAAA GTCTTGGC GC 14760 

GGTTGTACGA TTACCGCTTC TTCCTCGCTT 14820 

CCGCCTTAGA GCCGCTTTTC GATAAAAAAA 14880 

CCTATTTCGA TTTCTACCTG CAGCCGCAAA 14940 
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TCGTTACCTA CGCCAAAATC GCATCCATGC ACGGTTTCGA TTTGGGCATA GATCAAGAAA 15000 

TCTCACCGAG GGATTTGATT GTTTACGATC CGCTGCCGGC AGACGAATAT CAAGACATCT 15060 

TCGATTTTAT GAAACAGTAT GAC77GTCTT ACCCGTATGA ATATCTGCAG GATTGGATAG 1 51 20 

A77AG7A7AC G77CAAAACC GA7AAGC7GG 7A7ITGGTAA CGCGAAGCGA GAGTGAGCCG 151SC 

-AAAAC7C7G AGC7CC7G77 77A7AGA7TA CAACTTTAGG CCGTCTTAAA GCTGAAAGAT 1 524C 

tttcgaaagc 7 A7 aaagt 7 ga agccczzcca cagtacatag atctgigttg tggcggggct 153CC 

77ACCACGG7 GA77GCCGGA GAAGAAC7CA ACCTGCTGGC AAAACAAGGC ATGAGATCTT 1536C 

7GGAA7AAGA 7 G A G7 7 G A G A CC777GGAAA AAAGCCC77G CGGGAGA7CG GAAA G C C AAA 1 542Z 

CAGAGGA777 GGGG7G7777 G G7 AG G AAA7 AGG7GG7AA7 TT7ACCCAAA TACCCCC7TA 1 54 5 Z 

A7GC7GC7GG GA G A G G G GA 7 AA7GAG-GGA7 GCGGGCXGCC 7777AGGCGG CAGCGGGGGG 1 554C 

A777AGGG7G 77GGGGGG77 7GAAGAGG77 CAAACACATC GGC77CAGG7 GGCTTTGCGC 156CC 

AG7GAC777G TCA777CCAA 1562C 

[2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 amino acids 
(5) TYPE: amine acid 

(C) STRANDZDNZSS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(ix) FEATURE: 

(A) NAME /KEY: Protein 

(B) LOCATION: 1 . .530 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Lys Phe Phe Pro Ala Pro Cys Leu Leu Val lie Leu Ala Val He 
1 5 10 15 

Pro Leu Lys Thr Leu Ala Ala Asp Glu Asn Asp Ala Glu Leu He Arg 
20 25 30 

Ser Met Gin Arg Gin Gin His He Asp Ala Glu Leu Leu Thr Asc Ala 
35 40 45 

Asn Val Arg Phe Glu Gin Pre Leu Glu Lys Asn Asn Tyr Val Leu Ser 
5C 55 60 

Glu Asp Glu Thr Pre Cys Thr Arg Val Asn Tyr He Ser Leu Asp Asp 
CD to 75 SO 

Lys Thr Ala Arg Lys Phe Ser Phe Leu Pro Ser Val Leu Met Lys Glu 
S5 90 95 



Thr Ala Phe i.ys Thr Gly Met Cys Leu Gly Ser Asn Asn Leu 
TOO 105 110 



=er A: 



Leu Gin Lys Ala Ala Gin Gin He Leu He Val Arg Gly Tyr Leu Thr 

120 125 

Ser Gin Ala He He Gin Pro Gin Asn Met Asp Ser Gly He Leu Lys 
1 30 135 140 

Leu Arg Val Ser Ala Gly Glu He Gly Asp He Arg Tyr Glu Glu Lys 
145 150 155 160 

Arg Asp Gly Lys Ser Ala Glu Gly Ser He Ser Ala Phe Asn Asn Lys 
165 170 175 

Phe Pro Leu Tyr Arg Asn Lys He Leu Asn Leu Arg Asp Val Glu Gin 
180 185 190 
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Gly Leu Glu Asn Leu Arg Arg Leu Pro Ser Val Lys Thr Asp He Gin 
195 200 205 

lie lie Pre Ser Glu Glu Glu Gly Lys Ser Asp Leu Gin He Lys Trp 
210 215 220 

Sir. Gin Asn lys Pre He Arg Phe Ser He Gly He Asp Asp Ala Gly 
225 230 235 240 

31 V -V s ~ h - ~«~- Gly Lys Lyr Gin Gly Asn Val Ala Leu Ser Phe Asp 
245 250 255 

Asn Pro Leu Gly Leu Ser Asp Leu Phe Tyr Val Ser Tyr Gly Arg Gly 
250 265 270 

Leu Val His Lys Lhr Asp Leu Thr Asp Ala Thr Gly Thr Glu Thr Glu 
2" 5 280 2S5 



Ser Val His Tyr Ser Val P: 
295 300 



\a_ -ys lys 



Phe Asn His Asn Gly His Arg Tyr Hi 
310 315 



s Glu Ala Thr 
320 



Glu G -Y Tyr Ser Val Asn Tyr Asp Tyr Asn Gly Lys Gin Tyr Gin Ser 
325 330 335 

Ser Leu Ala Ala Glu Arg Met Leu Trp Arg Asn Arg Phe His Lys Thr 
340 345 350 

Ser Val Gly Met Lys Leu Trp Thr Arg Gin Thr Tyr Lys Tyr He Asp 
355 360 365 

Asp Ala Glu He Glu Val Gin Arg Arg Arg Ser Ala Gly Trp Glu Ala 
370 375 380 



Glu Leu Arg His Arg Ala Tyr Leu Asn Arg Trp Gin Leu Asp Gly Lys 
385 390 395 400 
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Tyr Lys Arg Giy Thr Gly Met Arg Gin Ser Met Pro Ala Pro 
405 410 415 

Asn Gly Gly Gly Thr He Pro Gly Thr Ser Arg Met Lys He 
420 425 430 

Ala Gly leu Asp Ala Ala Ala Pro Phe Met leu Gly Lys Gin 
455 440 445 

Phe lyr Ala Thr Ala He Gin Ala Gin Trp Asn lys Thr Pro 
455 461 

Ala Gin Asp Lys Leu Ser He Gly Ser Arg Tyr Thr Val Arg 
4"C 475 4SC 

Asp G_y Glu Gin Ser Leu Phe Gly Glu Arg Gly Phe Tyr Trp 
455 490 4S5 

Thr Leu Thr Trp Tyr Phe His Pro Asn His Gin Phe Tyr Leu 
3CC 505 510 



GLy Ala Asp ,yr Gly Arg Val Ser Gly Glu Ser Ala Gin Tyr Vai Ser 
515 520 525 

Gly Lys Gin Leu Met Gly Ala Val Val Gly Phe Arg Gly Giy His Lys 
53C 535 540 

Val Gly Gly v S r phe Ala Tyr Asp Leu Phe Ala Giy Lys Pro Leu His 
545 550 555 560 

Lys Pre Lys Gly Phe Gin Thr Thr Asn Thr Val Tyr Gly Phe Asn Leu 
565 570 575 

Asn Tyr Ser Phe 
580 



(2) ^NrGRVATHX FOR SEQ ID NO: 38: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 amino acids 
(3) TYPE: aou.no acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

:::) MOLECULE TYPE: peptide 

' A ; X AME /KEY : Peptide 
[3] LOCATION: 1 . .1931 

(xi) SEQ'S^CS LESCRIPTICX: SEQ IT NO : 33 : 

Mer Asr. Lys GLy Leu His Arg He He ?he Ser Lys Lys His Ser Thr 
3 1 C 15 

Met Va^ Ala Val Ala Glu Thr Ala Asn Ser Gin Gly Lys Gly Lys Gin 
20 25 30 

Ala Gly Ser Ser Val Ser Val Ser Leu Lys Thr Ser Gly Asp Leu Cys 
35 40 45 

Gly Lys Leu Lys Thr Thr Leu Lys Thr Leu Val Cys Ser Leu Val Ser 
50 55 60 

Leu Ser Met Val Leu Pro Ala His Ala Gin lie Thr Thr Asp Lys Ser 
65 70 75 80 

Ala Pro Lys Asn Gin Gin Val Val He Leu Lys Thr Asn Thr Gly Ala 
85 90 95 

Pro Leu Val Asn He Gin Thr Pro Asn Gly Arg Gly Leu Ser His Asn 
100 105 110 



REPLACEMENT SHEET (RULE 26) 



89 



Arg Tyr Thr Gin Phe Asp Val Asp Asn Lys Gly Ala Val Leu Asn Asn 

120 125 

Asp Arg Asn Asn Asn Pro Phe Leu Val Lys Gly Ser Ala Gin Leu He 
130 135 140 



145 



Val Arg Gly Thr Ala Ser Lys Leu Asn Gly He Val Thr 
15C 155 160 



Val Gly Gly Gin Lys 
165 



Vai He He Ala Asn Pre Asn Gly He 
170 175 



_hr Val Asn Gly Gly Gly Phe Lys Asn Val Gly Arg Gly He Leu Thr 
180 1S5 190 



Ly Ala 
1 55 



Le Gly Lys Asp Gly Ala Leu Thr Gly Phe Asp 
2CC 205 



21 ■ 



_hr Val Gly Ala Ala Gly Trp Asn Asp Lys 
215 220 



G_y Gly Ala Asp _yr ihr Gly Val Leu Ala Arg Ala Val Ala Leu Gin 
225 230 235 240 

GLy Lys Leu Gin Gly Lys Asn Leu Ala Val Ser Thr Gly Pro Gin Lys 
245 250 255 

Val Asp Tyr Ala Ser Gly Glu He Ser Ala Gly Thr Ala Ala Gly Thr 
260 265 270 

Lys Pro Thr He Ala Leu Asp Thr Ala Ala Leu Gly Gly Met Tyr Ala 
275 280 285 

Asp Ser He Thr Leu He Ala Asn Glu Lys Gly Val Gly Val Lys Asn 
290 295 300 



Ala Gly Thr Leu Glu Ala Ala Lys Gin Leu He Val Thr Ser Ser Gly 
305 310 315 320 



REPLACEMENT SHEET (RULE 26) 



90 



Arg He Glu Asn Ser Gly Arg He Ala Thr Thr Ala Asp Gly Thr Glu 
325 330 335 

Ala Ser Pro Thr Tyr Leu Ser He Glu Thr Thr Glu Lys Gly Ala Ala 
34C 345 350 

Gly Thr ?he He Ser Asn Gly Gly Arg lie Glu Ser Lys Gly Leu Leu 
355 360 365 

Val He Glu Thr Gly Glu Asp He Ser Leu Arg Asn Gly Ala Val Val 
3^0 375 3SC 

Gin Asn Asn Gly Ser Arg Pro Ala Thr Thr Val Leu Asn Ala Gly H^s 
355 3 90 395 £.rr 

Asn Leu Val He Glu Ser Lys Thr Asn Val Asn Asn Ala Lys GLy Ser 
4C5 410 415 

Ala Asn Leu Ser Ala Gly Gly Arg Thr Thr He Asn Asp Ala Thr He 
420 425 43C 

Gin Ala Gly Ser Ser Val Tyr Ser Ser Thr Lys Gly Asp Thr Glu Leu 
435 440 445 

Gly Glu Asn Thr Arg He He Ala Glu Asn Val Thr Val Leu Ser Asn 
450 455 460 

Gly Ser He Gly Ser Ala Ala Val He Glu Ala Lys Asp Thr Ala His 
465 470 475 480 

He Glu Ser Gly Lys Pro Leu Ser Leu Glu Thr Ser Thr Val Ala Ser 
485 490 495 

Asn He Arg Leu Asn Asn Gly Asn He Lys Gly Gly Lys Gin Leu Ala 
500 505 51 0 



REPLACEMENT SHEET (RULE 26) 
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Leu Leu Ala Asp Asp Asn He Thr Ala Lys Thr Thr Asn Leu Asn Thr 
515 520 525 

Pro Gly Asn Leu Tyr Val Has Thr Gly Lys Asp Leu Asn Leu Asn Val 
530 535 540 

Asp Lys Asp Leu Ser Ala Ala Ser He His Leu Lys Ser Asp Asn Ala 
545 550 555 560 

Ala His He Thr Gly Thr Ser Lys Thr Leu Thr Ala Ser Lys Asp Met 
565 570 575 

31 Y Ala Gly Leu Leu Asn Val Thr Asn Thr Asn Leu Arg Thr 

530 5S5 590 

Asn Ser Gly Asn leu His He Gin Ala Ala Lys Gly Asn He Gin Leu 
355 600 605 

Arg Asn Thr Lys Leu Asn Ala Ala Lys Ala Leu Glu Thr Thr Ala Leu 
61 c SI 5 620 

Gin Gly Asn He Val Ser Asp Gly Leu His Ala Val Ser Ala Asp Gly 
625 630 635 640 

His Val Ser Leu Leu Ala Asn Gly Asn Ala Asp Phe Thr Gly His Asn 
645 650 655 

Thr Leu Thr Ala Lys Ala Asp Val Asn Ala Gly Ser Val Gly Lys Gly 
660 665 670 

Arg Leu Lys Ala Asp Asn Thr Asn He Thr Ser Ser Ser Gly Asp He 
675 680 685 

Thr Leu Val Ala Gly Asn Gly He Gin Leu Gly Asp Gly Lys Gin Arg 
690 695 700 

Asn Ser He Asn Gly Lys His He Ser He Lys Asn Asn Gly Gly Asn 
705 710 715 720 



REPLACEMENT SHEET (RULE 26) 



92 



Ala Asp Leu Lys Asn leu Asn Val His Ala Lys Ser Gly Ala Leu Asn 
725 730 735 

lie His Ser Asp Arg Ala Led Ser lie Glu Asn Thr Lys Leu Glu Ser 
740 745 750 

Thr Hrs Asn Thr His leu Asn Ala Gin Kis Glu Arg Val Thr leu Asn 
"55 760 765 

G_n Val Asp Ala Tyr Ala His Arg His Leu Ser He Thr Gly Ser Gin 
77 0 ~> ~ 5 7so 

He Trp Gin Asn Asp lys lej Pro Ser Ala Asn Lys Leu Val Ala Asn 
TS5 " 9 1 795 SCO 

Gly Val Leu A.la Leu Asn Ala Arg Tyr Ser Gin lie Ala Asr; Asn Thr 
SC5 810 615 

Thr Leu Arg Ala Gly Ala He Asn Leu Thr Ala Gly Thr Ala Leu Val 
320 825 830 

Lys Arg Gly Asn He Asn Trp Ser Thr Val Ser Thr Lys Thr Leu Glu 
S35 840 845 

Asp Asn Ala Glu Leu Lys Pro Leu Ala Gly Arg Leu Asn He Glu Ala 
850 855 860 

Gly Ser Gly Thr Leu Thr He Glu Pro Ala Asn Arg He Ser Ala His 
865 870 875 880 

Thr Asp Leu Ser He Lys Thr Gly Gly Lys Leu Leu Leu Ser Ala Lys 
855 890 895 

Gly Gly Asn Ala Gly Ala Pro Ser Ala Gin Val Ser Ser Leu Glu Ala 
900 905 910 



REPLACEMENT SHEET (RULE 26) 



93 



Lys Gly Asn lie Arg Leu Val Thr Gly Glu Thr Asp Leu Arg Gly Ser 
915 920 925 

Lys lie Thr Ala Gly Lys Asn Leu Val Val Ala Thr Thr Lys Gly Lys 
93C 935 940 



Ser Phe Ser Asn 



.yr rr.: 



960 



-7 s ---1= Ala Glu Leu A sr. Gin Lys Ser Lys Glu Leu Glu Gin Gin 
965 970 975 

-.la Gin Leu Lys Lys Ser Ser Pro Lys Ser Lys Leu lie Pro Thr 

95 2 9S5 99Q 

r_n G_u G_u Arg Asp Arg Leu Ala Phe Tyr lie Gin Ala lie Asn 
995 10C0 1C05 

^lu Val Lys Gly Lys Lys Pro Lys Gly Lys Glu Tyr Leu Gin Ala 
C'C 1 CI 5 1020 



Lys 
1 0 2 : 



Ala Gin Asn He Asp Leu He Ser Ala Gin Gly He Glu 
1030 1035 1040 



' c - - r:~ 



y Ser Asp lie Thr Ala Ser Lys Lys Leu Asn Leu His Ala 
1045 1050 1055 



Ala Gly Val Leu Pro Lys Ala Ala Asp Ser Glu Ala Ala Ala He Leu 
1 C5C 1065 1070 

He Asp Gly He Thr Asp Gin Tyr Glu He Gly Lys Pro Thr Tyr Lys 
1C75 1080 1085 

Ser His Tyr Asp Lys Ala Ala Leu Asn Lys Pro Ser Arg Leu Thr Gly 
1C90 1095 1100 



Arg Thr Gly Val Ser He Eis Ala Ala Ala Ala Leu Asp Asp Ala Arg 
11 C5 1110 1115 1120 



REPLACEMENT SHEET (RULE 26) 
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He ile He Gly Ala Ser Glu He Lys Ala Pro Ser Gly Ser He Asp 
1125 1130 1135 

He Lys Ala His Ser Asp Ile Val Leu Glu Ala Gly Gin Asn Asp Ala 
1140 1145 1150 

Tyr Thr ?he Leu Lys Thr Lys Gly Lys Ser Gly Lys Ile lie Arg Lys 
1155 1 1 6G 1165 

Lhr Lys Phe Lhr Ser Ihr Arg Asp His Leu Ile Met Pro Ala Pro Val 
' 1 ~~ ' : "5 1 1 SO 

Glu Leu Lhr Ala Asn Gly Ile Lhr Leu Gin Ala Gly Gly Asn Ile Glu 

•'S5 :i=: 1195 120C 



.r.r _nr Arg -r 

12 3 5 



.a -rc A- a G_y lys Val _hr Leu Va. 
1 21 C 1215 



Ala Gly Glu Glu Leu GI 
1220 



au Ala Glu Glu Gly rle His Lys H_£ 
1225 123C 



Glu Leu Asp Val Gin Lys Ser Arg Arg Phe lie Gly lie Lys Val Gly 
1235 1240 1245 

Lys Ser Asn Tyr Ser Lys Asn Glu Leu Asn Glu Thr Lys Leu Pro Val 
1250 1255 1260 

Arg Val Val Ala Gin Lhr Ala Ala Thr Arg Ser Gly Trp Asp Thr Val 
1265 1270 1275 1280 

Leu Glu Gly Thr Glu Phe Lys Thr Thr Leu Ala Gly Ala Asp Ile Gin 
1285 1290 1295 



Ala Gly Val Gly Glu Lys Ala Arg Val Asp Ala Lys Ile Ile Leu Lys 
1300 1305 1310 



REPLACEMENT SHEET (RULE 26) 



95 



Gly lie Val Asn Arg He Gin Ser Glu Glu Lys Leu Glu Thr Asn Ser 
1315 1320 1325 

Thr Val Trp Gin Lys Gin Ala Gly Arg Gly Ser Thr He Glu Thr Leu 
1330 1335 1340 

Lys Leu Pro Ser Fhe Glu Ser Pro Thr Pre Pro Lys Leu Ser Ala Pre 
1345 1350 1355 1360 

Gly Gly Tyr He Val Asp lie Pro Lys Gly Asr. Leu Lys Thr Glu He 
1365 1370 1375 

Glu Lys Leu Ser Lys Gin Pre Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
1380 1385 1390 

Val Ala Lys Asn He Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Arg 
1395 1400 1405 

Trp Asp Tyr Lys Gin Glu Gly Leu Thr Glu Ala Gly Ala Ala He He 
1410 1415 1420 

Ala Leu Ala Val Thr Val Val Thr Ser Gly Ala Gly Thr Gly Ala Val 
142 5 1430 1435 1440 

Leu Gly Leu Asn Gly Ala Ala Ala Ala Ala Thr Asp Ala Ala Phe Ala 
1445 1450 1455 

Ser Leu Ala Ser Gin Ala Ser Val Ser Phe He Asn Asn Lys Gly Asp 
1460 1465 1470 

Val Gly Lys Thr Leu Lys Glu Leu Gly Arg Ser Ser Thr Val Lys Asn 
1475 1480 1485 

Leu Val Val Ala Ala Ala Thr Ala Gly Val Ala Asp Lys He Gly Ala 
1490 1495 1500 

Ser Ala Leu Asn Asn Val Ser Asp Lys Gin Trp He Asn Asn Leu Thr 
1 $05 1510 1515 1520 



REPLACEMENT SHEET (RULE 26) 
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Val Asn Leu Ala Asn Ala Gly Ser Ala Ala Leu He Asn Thr Ala Val 
1525 1530 1535 

Asn Gly Gly Ser Leu lys Asp Asn Leu Glu Ala Asn He Leu Ala Ala 
1540 1545 1550 

Leu Val Asn Thr Ala H1.3 Gly Glu Ala Ala Ser Lys He Lys Gin Leu 

1 555 1 560 1 565 

-~- s ? £- r - Lyr He Val His Lys He Ala His Ala He Ala Gly Cys 

■5"C 1575 158C 



Lys Gly Lys Cys Gin Asp Gly Ala He Gly 
1595 15CC 



Ala Ala Val Gly Glu He Va2 



Ly Glu Ala Leu Thr Asn Gly .uys Asn 
1 61 C 1615 



_ys Glu Arg Glu Gin He Leu Ala i'yr Ser 
1625 1630 



Lys Leu Val Ala Gly Thr Val Ser Gly Val Val Gly Gly Asp Val Asn 
1635 1640 1645 

Ala Ala Ala Asn Ala Ala Glu Val Ala Val Lys Asn Asn Gin Leu Ser 
165G 1655 1660 

Asp Lys Glu Gly Arg Glu ?he Asp Asn Glu Met Thr Ala Cys Ala Lys 
1665 1670 1675 1680 

Gin Asn Asn Pro Gin Leu Cys Arg Lys Asn Thr Val Lys Lys Tyr Gin 
1685 1690 1695 



Asn Val Ala Asp Lys Arg Leu Ala Ala Ser He Ala He Cys Thr Asp 
1700 1705 1710 



REPLACEMENT SHEET (RULE 26) 
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He Ser Arg Ser Thr GIu Cys Arg Thr He Arg Lys Gin His Leu He 
1715 1720 1725 

Asp Ser Arg Ser Leu His Ser Ser Trp GIu Ala Gly Leu He Gly Lys 
1730 1735 1740 

Asp Asp GIu Trp Tyr Lys Leu Phe Ser Lys Ser Tyr Thr Gin Ala Asp 
1745 1-5C 1755 1760 

leu Ala Leu Gin Ser Lyr His Leu Asn Thr Ala Ala Lys Ser Trp Leu 
1765 17" c 1775 

GLr_ Ser Gly Asn Thr Lys Pro Leu Ser GIu Trp Met Ser Asp Gin Gly 
•~SC 1785 179C 



Ser Gly Val Asn Pro Arg Phe 
1 800 



1 805 



1 515 



1 820 



182: 



Phe Asp Lr.r Asn Leu Lys Arg His Leu Ala Asn Ala Asp 
1550 1835 1840 



Gly Phe Ser Gin GIu Gin Gly He Lys Gly Ala His Asn Arg Thr Asn 
1845 1850 1855 

Phe Ket Ala GIu Leu Asn Ser Arg Gly Gly Arg Val Lys Ser GIu Thr 
1S60 1865 1870 

Gin Thr Asp He GIu Gly He Thr Arg He Lys Tyr GIu He Pro Thr 
1875 1880 1885 

Leu Asp Arg Thr Gly Lys Pro Asp Gly Gly Phe Lys GIu Tie Ser Ser 
1890 1895 1900 



He Lys ihr Val Tyr Asn Pro Lys Lys Phe Ser Asp Asp Lys lie Leu 
1905 1910 1915 1920 



REPLACEMENT SHEET (RULE 26) 
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Gin Met Ala Gin Asn Ala Ala Ser Gin Gly Tyr Ser Lys Ala Ser Lys 
1925 1930 1935 

lie Ala Gin Asn Glu Arg Thr lys Ser He Ser Glu Arg Lys Asn Val 
1940 1945 1950 

He Gin Phe Ser Glu Thr ?he Asp Gly He Lys ?he Arg Ser Tyr Phe 
1355 1960 1965 

Asp Val A.sn Thr Gly Arg He Thr Asn He His Pro Glu 

1975 19S0 

[2] INrCRMAHCX FCR SEQ IT NO: 39: 




(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . .143 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Lys Asn Asn He Phe Leu Asn Leu Asn Lys Lys Ser He Asn Asn 
15 10 15 

Asn Kis Phe Val He Ser He Phe Phe Glu Thr He Tyr Gin Phe Glu 
20 25 30 



REPLACEMENT SHEET (RULE 26) 



m 

111 



99 



ihr Lys Asp Thr Leu Leu Glu Cys Phe Lys Asn lie Thr Thr Thr Gly 
35 40 45 

His Phe Gly Val lie Gly Ala Gin Tyr Glu Lys He Asp Ala Thr Arg 
50 55 60 

Ire He Gly Asp Tyr Glu Glu Val Asn Gly Phe Glu Tyr He Asp Lys 
65 70 75 SO 

Ala Pro Ser He Tyr Phe Ser Val Gly Asp Asp Phe Asn Pro Glu Glu 
65 90 95 

leu He He Pre He Asr. leu Ala Tyr Kis Tyr Phe Asn He Ala He 

105 110 

Ser Asp Phe leu He Ala His Pre Glu Tyr Gin Lys Lys Cys Lys Glu 
115 120 125 



----- n~ 



r Ser G_r. ir.r Asn Cys Ser leu His Glu Thr 



'3: 135 140 

o 

£ (2) INFORMATION FOR SFQ IF NO: 40: 

m 

oj 

L r* (i) SFQTZXCZ CHARACTERISTICS: 

*3 IA) LENGTH: 633 ar.rno acids 

(3) TYPE: air.ino acid 

£ (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



Hi) MCLECTLE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 
(3) LOCATION: 1 . .833 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



REPLACEMENT SHEET (RULE 26) 



100 



Val Leu Lys Thr Pro Pro Thr Leu Ala Ala Glu Leu Ser Gly Lys Thr 
15 10 15 

Gly Val Ser He Ser Ala Pro Tyr Ala Asn Glu Asn Ser Arg He Leu 
20 25 30 

leu Ser Thr Thr Asp He Ser Ser Glu Asn Gly Lys He Lys He Gin 
35 40 45 

Ser Tyr Gly Asp Glr. lyr Tyr Tyr Ala Arg Gin Ser Glu Leu Tyr Thr 
5: 55 60 

Pr.e G_u Arg Arg Ser Tyr lys Thr Gly Lys Trp Tyr Asn Arg Lys His 
55 H 75 SO 



r_U . a_ -jVS Gj.U "15 



_ys Asn Ala Lys ?: 
90 



1GC 



Gly He Asp He Lys Ser Gly Gly Ser He Asp 
105 110 



Ala _yr Ala .hr Ala The Asp Ala Pro Lys Gly Ser He Asn He Glu 
115 120 125 

Ala Gly Arg Lys Leu Thr Leu Tyr Ala Val Glu Glu Leu Asn Tyr Asp 
130 135 140 

Lys Leu Asp Ser Gin Lys Arg Arg Arg Phe Leu Gly lie Ser Tyr Ser 
145 150 155 160 

Lys Ala His Asp Thr Thr Thr Gin Val Met Lys Thr Ala Leu Pro Ser 
165 170 175 



Arg Val Val Ala Glu Ser Ala Asn Leu Gin Ser Gly Trp Asp Thr Lys 
1S0 185 190 



REPLACEMENT SHEET (RULE 26) 



101 



Leu Gin Gly Thr Gin Phe Glu Thr Thr Leu Gly Gly Ala Thr He Arg 
195 200 205 

Ala Gly Val Gly Glu Gin Ala Arg Ala Asp Ala Lys He He Leu Glu 

210 215 220 



^_y __e Lys Ser 
225 



23 



e His ihr Glu _hr Val Ser Ser Ser Lys Ser 
0 235 240 



_ys G_r. A_ a Gly Arg Gly Ser Asn He Glu Th: 
245 250 25: 



Phe _hr Gly Pro Val Ala Pro Val Leu Ser 



Ala. Pro 



265 



270 



-y o-y -yr „e va: Asp 
275 



-le Pro Lys Gly Asn Leu Lys Thr Gin lie 
2S0 285 



Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
295 300 



Val Ala Lys Asn 



Le Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Lys 
310 315 320 



.rp Asp .yr Lys Gin Glu Gly Met Thr Pro Ala Ala Ala Ala Val Val 
325 330 335 

Val He Val Val Thr Val Leu Thr Tyr Gly Ala Leu Ser Ala Pro Ala 
340 345 350 

Ala Ala Gly Thr Ala Gly Ala Ala Gly Ala Gly Ala Gly Gly Ala Ala 
355 360 365 

Ala Gly Thr Ala Ala Gly Thr Gly Val Ala Ala Gly Thr Ala Ala Thr 
370 375 380 



Thr Gly Val Ala Ala Gly Thr Ser Ala Ala Ala He Thr Thr Ala Ala 
3S5 390 395 400 



REPLACEMENT SHEET (RULE 26) 
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Gly Lys Ala Ala Leu Ala Ser Leu Ala Ser Gin Ala Ala Val Ser Leu 
4C5 410 41 5 

lie Asr. Asn Lys Gly Asp lie Asn His Thr Leu Lys Glu Leu Gly Lys 
420 425 430 

Ser Ser Thr Val Arg Gin Ala Ala Thr Ala Ala Val Thr Ala Gly Val 
435 440 445 

Leu Glr. Gly lie Ser Gly Leu Asn Thr Gin Ala Ala Glu Ala Val Ser 
45C 455 460 

Ser ?r: Ala Ala Gly Lys Leu Thr Ala Asr. Leu He 
4~3 475 48C 

Ala Ala Ser Val His Thr Ala He Asr. Gly Gly Ser 
455 490 495 

Leu Gly Asp Ala Ala Leu Gly Ala He Val Ser Thr 
5 CO 505 510 

^£■1 K:_s Gly Glu Val Ala Ser Lys He Lys Phe Asn Leu Ser Glu Asp 
515 520 525 

Tyr lie Ala His Lys lie Ala His Ala Val Ala Gly Cys Ala Ser Ala 
530 535 540 

Val Ala Asn Lys Gly Lys Cys Arg Asp Gly Ala He Gly Ala Ala Val 
545 550 555 560 

Gly Glu Met Val Gly Glu Thr Leu Leu Asp Gly Arg Asp Val Gly Lys 
565 570 575 

Leu Ser Pro Gin Glu Arg Gin Lys Val He Ala Tyr Ser Gin lie He 
5S0 585 590 



REPLACEMENT SHEET (RULE 26) 
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Ala Gly Ser Ala Val Ala Leu Val Lys Gly Asp Val Asn Thr Ala Val 
595 600 605 

Asn Ala Ala Thr Val Ala Val Glu Asn Asn Ser Leu Leu Ala Arg Arg 

610 615 620 

Arg Va. Asr. lie Arg Trp Thr Pro Arg Gin Glu Leu Glu His Glu Tyr 
625 63C 635 64C 

Ala lie Leu Glu He 3 1 r. Ala He Thr Asn Gin He Arg Arg Leu As 



643 65C 



6 55 



_rc _ys -re Asn Gly He Ala He Leu Arg _hr Pre Gly Glu Pro Trp 
66C 665 670 

Thr Arg His Asp Val Gin Thr Tyr Arg Gin Tyr Tyr Asn Gin Leu Arg 

~ ' ^ CO 1 j o a D 

6rC 695 700 

Asn Gly Asn Glu Phe Asn Arg He Met Ser Ser Lys Tyr Pre Tyr Asn 
7C5 710 715 720 

Glu Leu Tyr Val Ala Asn Pro Lys Ser Ala Thr Gly Tyr Phe Arg Val 
725 730 735 

Asp Ser Tyr Asp Pro Ala Thr Arg Glu He He Ser Arg Lys Phe Thr 
740 745 750 

Gin Phe Ser Gin He Gin Glu Ser Thr Gly He Gly Tyr He Lys Glu 
755 760 765 

Ala Val Arg Lys Tyr Ser Pro Gly Thr Val He Ser Asn Val Pro Ser 
770 775 780 

Thr Pre Thr Thr He Arg Gly Arg Lys Leu Glu Gly Lys Leu He Leu 
"55 790 795 800 



REPLACEMENT SHEET (RULE 26) 



104 



Glu Val Pre Ala Gin Val Asn Pro lie Pro Gin Ser Val Leu Arg Ala 
8C5 810 815 

Ala Gin Glu Glu Asr. Val lie lie Arg Asp Thr Thr Gly Arg lie Tyr 
82C 825 830 

lys 

; 2 , INFORMATICS ICR 3Z~ IT XI : 41: 

sequence c:-l-j?-Actepistics: 
!x: sx^xzxzz tz3cpipticx: szq id no: 41 : 

Val leu lys Thr Pre Pro Thr leu Ala Ala Glu Leu Ser Gly Lys Thr 
1 5 10 15 

Gly Val Ser lie Ser Ala Pre Tyr Ala Asn Glu Asn Ser Arg He Leu 
2C 25 3G 

leu Ser Thr Thr Asp He Ser Ser Glu Asn Gly Lys He Lys He Gin 
35 40 45 

Ser Tyr Gly Asp Gin Tyr Tyr Tyr Ala Arg Gin Ser Glu Leu Tyr Thr 
50 55 60 

Phe Glu Arg Arg Ser Tyr Lys Thr Gly Lys Trp Tyr Asn Arg Lys His 
65 70 75 80 

He Thr Glu Val Lys Glu His Lys Asn Ala Lys Pro Asp Ala Val Asn 
85 90 95 



REPLACEMENT SHEET (RULE 26) 



105 



= er Ala Ser Gin Gly lie Asp He Lys Ser Gly Gly Ser He Asp 
1 CO 1 Q 5 110 

7yr Ala Thr Ala Phe Asp Ala Pro Lys Gly Ser He Asn He Glu 

115 120 125 

;iy Arg Lys Leu Thr Leu ~yr Ala Val Glu Glu Leu Asn Tyr Asp 
: 3C 135 140 

_e_: Asp Ser Sir. lys A.rg Arg Arg Phe Leu Gly He Ser Zyr Ser 

155 160 



ri- - -r r\ £i 



Met Lys Thr Ala 
170 175 



1 6: 



Ser Gly _rc Asc _hr Lys 
1 90 



_hr Gin Phe Glu Thr Thr Leu Gly Gly Ala Th: 
200 205 



_ _e Arc 



c^y .a 

21 : 



- C-_y 



~r; Ala Arg Ala Asp Ala Lys He He Leu Glu 
215 220 



225 



=r Ser ule His Thr Glu Thr Val Ser Ser Ser Lys Ser 
230 235 240 



_hr _eu _rp Gin Lys Gin Ala Gly Arg Gly Ser Asn He Glu Thr Leu 
245 250 255 

Gin Leu Pro Ser Phe Thr Gly Pro Val Ala Pro Val Leu Ser Ala Pro 
260 265 270 



;iy Gly _yr He Val Asp lie Pro Lys Gly Asn Leu Lys Thr Gin He 
275 280 285 



REPLACEMENT SHEET (RULE 26) 
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Glu Thr Leu Thr Lys Gin Pro Glu Tyr Ala Tyr Leu Lys Gin Leu Gin 
290 295 300 

Val Ala Lys Asn lie Asn Trp Asn Gin Val Gin Leu Ala Tyr Asp Lys 
305 310 315 320 



Trp Asp ^yr Lys 



32: 



y Met Thr Pro Ala Ala Ala Ala Val Val 
330 335 



Val lie Val Val _hr Val Leu ^hr iyr Gly Ala Leu Ser Ala Pre Ala 
340 345 350 

Ala Ala Gly Thr Ala Gly Ala Ala Gly Ala Gly Ala Gly Gly Ala Ala 
355 360 365 



Ala Gly Thr Ala Ala Gly Thr Gly Val Ala Ala Gly Thr Ala 
37C 375 380 



.a Thr 



Thr Gly Val Ala Ala Gly Thr Ser Ala Ala Ala lie Thr Thr 
385 39C 395 



400 



Gly Lys Ala Ala Leu Ala Ser Leu Ala Ser Gin Ala Ala Val Ser Leu 
405 410 415 

lie Asn Asn Lys Gly Asp lie Asn His Thr Leu Lys Glu Leu Gly Lys 
420 425 430 

Ser Ser Thr Val Arg Gin Ala Ala Thr Ala Ala Val Thr Ala Gly Val 
435 440 445 

Leu Gin Gly lie Ser Gly Leu Asn Thr Gin Ala Ala Glu Ala Val Ser 
450 455 460 

Lys His Phe His Ser Pro Ala Ala Gly Lys Leu Thr Ala Asn Leu lie 
465 470 475 480 



Asn Ser Thr Ala Ala Ala Ser Val His Thr Ala lie Asn Gly Gly Ser 
485 490 495 



REPLACEMENT SHEET (RULE 26) 



1 07 



Leu Lys Asp Asn Leu Gly Asp Ala Ala Leu Gly Ala lie Val Ser Thr 
500 505 510 

Val Kis Gly Glu Val Ala Ser Lys lie Lys Phe Asn Leu Ser Glu Asp 
515 520 525 

Tyr lie Ala Hrs Lys He Ala His Ala Val Ala Gly Cys Ala Ser Ala 
530 535 540 

Val Ala Asn Lys Gly Lys Cys Arg Asp Gly Ala He Gly Ala Ala Val 
545 550 555 560 

Gly Glu Met Val Gly Glu Thr Leu Leu Asp Gly Arg Asp Val Gly Lys 
555 570 5"5 

Leu Ser Pre GLn GLu Arg Gin Lys Val He Ala Tyr Ser Gin He He 
53C 535 590 

Ala Gly Ser Ala Val Ala Leu Val Lys Gly Asp Val Asn Thr Ala Val 
3 " 3 6 C 0 605 

Asn Ala Ala Thr Val Ala Val Glu Asn Asn Ser Leu Leu Ala Arg Arg 
610 615 620 

Arg Val Asn He Arg Trp Thr Pro Arg Gin Glu Leu Glu His Glu Tyr 
62 5 630 635 640 

Ala lie Leu Glu He Gin Ala He Thr Asn Gin He Arg Arg Leu Asp 
645 650 655 

Pro Lys Phe Asn Gly He Ala He Leu Arg Thr Pro Gly Glu Pro Trp 
660 665 670 

Thr Arg His Asp Val Gin Thr Tyr Arg Gin Tyr Tyr Asn Gin Leu Arg 
675 680 685 



REPLACEMENT SHEET (RULE 26) 
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Glu Ser Arg Gly 
690 

Asn Gly Asn Glu 
705 

Glu Leu Tyr Val 

Asp Ser Tyr Asp 
74C 

Glr. ?he Ser Gin 
755 

Ala Val Arg lys 
770 

Thr Pro Thr Thr 
7 3 5 

Glu Val Pro Ala 

Ala Gin Glu Glu 
820 

lys 



Phe Ala Val Glu 
695 

Phe Asn Arg lie 
71 C 

Ala Asn Pro Lys 
725 

Pre Ala Thr Arg 




7 or 



Gin Val Asn Pro 
805 

Asn Val lie lie 



Pro lie Tyr Arg 
700 

Met Ser Ser Lys 
715 

Ser Ala Thr Gly 
730 

Glu lie lie Ser 
745 

"hr Gly He Gly 

7hr Val He Ser 
780 

lys Leu Glu Gly 
795 

He Pro Gin Ser 
81 0 

Arg Asp Thr Thr 
825 



He Arg He Asn 



Tyr Pro Tyr Asn 
720 

Tyr Phe Arg Val 
735 

Arg Lys Phe Thr 
750 

Tyr He Lys Glu 
765 

Asn Val Pro Ser 



Lys Leu He Leu 
800 

Val Leu Arg Ala 
815 

Gly Arg lie Tyr 
830 



{2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



REPLACEMENT SHEET (RULE 26) 



1 09 

(ii) MOLECULE TYPE: peptide 



[ix) FEATURE: 

(A) NAME/ KEY: Peptide 
(5) LOCATION: 1 . . 162 



Met Lys Lys Asp ue ?hs Tyr Cys Glu Gin Trp Ser Tyr Gly Tyr Lys 
1 5 10 15 

Arg Leu His Lys Pro Phe Ser Glu Lys Gin Ala Glu Glu Lys His Leu 
2 C 2 5 3 C 

-ys ^-Y Glu Leu Tyr Thr Ala Vai lie Gly Ser Ala Thr Gin Pro Glu 
35 40 45 

Tyr Val He Thr Leu Arg Glu Glu Val Gly Phe Phe Ser Val Asn Phe 
53 55 60 

Phe Asp Lys Phe Gly Arg Asp Tyr Leu Thr His Gin Phe Gin Lys Tyr 
65 70 75 80 

Ser Asn Ser Asn Tyr Tyr Phe Leu Ser Met Ala Val Trp Arg Asp Tyr 
85 90 95 

He Thr Leu Glu Ser His Asp Leu Ala Glu Gly Tyr Thr Tyr Phe Phe 
100 105 110 

Asn Glu Asn Thr Asp Asp Cys Tyr Val Leu Lys Gin Asp Phe He Asn 
115 120 125 

Asn Glu Arg Tyr Glu Lys Thr Glu Leu Tyr Ser Gin Lys Asp Lys Val 
130 135 140 



REPLACEMENT SHEET (RULE 26) 



110 



He Leu ?he Pro Lys Phe Gly Glu Tyr Asp Leu Val Leu Asn Pro Asp 
145 150 155 160 

He He 



2; IXr CRVATICX ECR SEQ ID NO: 43: 

(_; SEQUZXCZ CHARACTERISTICS: 

(A} LENGTH: 90 ammo acids 
(3; TYPE: anno acid 
iC) STRAXTETXESS: single 
(C) TCPCLOGY: linear 

.n, MTLECY1E TYPE: ceotide 

(ix) FEATURE: 

[ A ' XAME / KEY : Pec 1 1 de 
(3; LOCATION: 1 . .90 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Asn Lys Arg Met Lys Met Cys Pro Ala Cys Gin Gin Gly Tyr Leu 
15 10 15 

Tyr His Ser Lys Pro Lys Tyr Leu His Asp Glu He He Leu Cys Asp 
20 25 30 

Glu Cys Asp Ala Val Trp Leu Lys Gly Met Asn He Phe Tyr Gly Glu 
35 40 45 

Tyr Glu Lys Asp Phe Tyr Ser Tyr Val Pro Phe Met Glu Ser Gin Gly 
50 55 60 

lie Thr Ser Glu Cys He Trp Glu Gly Asp Leu Phe Asp His Pro Tyr 
65 70 75 80 
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Tyr Glu Asp Glu Asn Ser Asn Asp Met Asp 
85 90 

INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARAC "ERISTICS : 

(A; LENGTH : 313 ar.ino acids 

(3) TYPE: a-mc acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: pepride 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(E) LOCATION: 1..313 

(xi) SEQUENCE CESCRIPTION: SEQ ID NO: 44: 

Met Ser Ala Thr Glu lie Glu Lys Ala Lys Ala Lys He Thr Ala Tyr 
15 10 15 

Ser Lys Leu Val Ala Gly Thr Ala Ser Ala Val Val Gly Gly Asp Val 
20 25 30 

Asn Thr Ala Ala Asn Ala Ala Gin He Ala Val Glu Asn Asn Thr Leu 
35 40 45 

Tyr Pro Arg Cys Val Gly Ala Lys Cys Asp Glu Phe Gin Lys Glu Gin 
50 55 60 

Gin Lys Trp He Arg Glu Asn Pro Glu Glu Tyr Arg Glu Val Leu Leu 
65 70 75 80 



REPLACEMENT SHEET (RULE 26) 



112 



Fhe Gin Thr Gly Phe lie Pro lie lie Gly Asp lie Gin Ser Phe Val 
85 90 95 

Gin Ala Gin Thr Ala Ala Asp His Leu Phe Ala Leu Leu Gly Val Val 
100 105 110 

Pre Gly lie Gly Glu Ser lie Gin Ala Tyr Lys Val Ala Lys Ala Ala 
115 120 125 

lys Asn Leu Gin Gly Met Lys Lys Ala Leu Asp Lys Ala Ala Thr Val 
13C 135 140 



145 



Gly Tyr Val Ser Lys Thr Lys lie Lys lie Gly Gin 
15C 155 160 



Thr Glu Leu Arc Val Thr 



Ala Ala Thr Asp Lys Gin Leu -eu Lys Ala 
170 175 



He Gly Glu Gly Arg Asp Thr Thr Gly Lys Met Thr Glu Gin Leu Phe 
18C 1S5 190 

Asp Ser Leu Ala Lys Gin Asn Gly Phe Arg Val Leu Ser Gly Gly Lys 
195 2G0 205 

Tyr Gly Gly Asn Asn Gly Phe Asp His Val Trp Gin Ala Ala Asp Gly 
21 0 21 5 220 

Ser Val Val Leu He Val Glu Ser Lys Gin He Arg Asn Gly Thr Val 
225 230 235 240 

Gin Leu Asn Pro Asn Gly Ala Gly Gly Tyr Thr Gin Met Ser Glu Asp 
245 250 255 

Trp He Arg Gin Val Leu Asp Gin Leu Pro Asp Gly Ser Pro Ala Lys 
260 265 270 



Ala Ala Val Phe Lys Ala Asn Lys Asn Gly Thr Leu Lys Thr Ala He 
275 280 285 
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Ala Gly Val Asp Arg Gin Thr Gly Lys Ala Val lie Leu Pro Val Lys 
290 295 300 

Val Pro Ser Lys Thr Asn lie Arg Arg 
3C5 310 

INFORMATION FOR SEQ ID NO: 45: 

(-) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 a_~ino acids 
(3) TYPE: arr.ino acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 

'n; MOLECULE TY 3 E : cernde 



(ix) FEATURE: 

(A) NAME / KEY : Peptide 
(3) LOCATION: 1..311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Gly His Asn Met Met Thr Thr Gin Lys Trp Tyr Glu His lie Thr 
15 10 15 

Asn Val lie lie Gly Asn Thr Ala Asn Phe Asn Ser Gly Cys Leu Asp 
20 25 30 

Ser lie Asp Tyr Val Asp Glu Arg Lys Gly Val Pro Leu Ala Ala Met 
35 40 45 

Gin His lie Phe Met Asp Val Arg Ala Ala Ala Ser His Ala Tyr Leu 
50 55 60 



REPLACEMENT SHEET (RULE 26) 



114 



?he Glu Kis Asp Leu Lys Lys Phe Lys Gin Tyr Ala Tyr Val Ala Gly 
65 70 75 80 

Lys Leu Gly Val Leu Leu Ser Val Asn Ser Thr Asp Pro Glu Pro Phe 
85 90 95 

-ys Asp 

ICC 105 110 

Leu Met Ser Asp Ser Pro Gin Leu Arg Glu Phe Leu Val Arg Asn lie 
115 120 125 

Asp Asn lie Ala Asn Asp Thr Glu Ala Phe lie Asn Arg Tyr Asp Leu 
130 135 140 

Asn Arg His Y.ez lie Tyr Asn Thr Leu Leu Met Val Glu Gly Lys Gin 
1^= 150 155 160 

Leu Asp Arg Leu Lys 31n Arg Ser Glu Lys Val le: Ala His Pro Thr 
165 170 175 

Pro Ser Lys Trp Leu Gin Lys Arg Leu Tyr Asp Tyr Arg Phe Phe Leu 
180 185 190 

Ala Phe Ala Glu Gin Asp Ala Glu Ala Met Lys Ala Ala Leu Glu Pro 
195 200 205 

Leu Phe Asp Lys Lys Thr Ala Arg Met Ala Ala Lys Glu Thr Leu Ser 
210 215 220 

Tyr Phe Asp Phe Tyr Leu Gin Pro Gin lie Val Thr Tyr Ala Lys lie 
225 230 235 240 

Ala Ser Met His Gly Phe Asp Leu Gly lie Asp Gin Glu lie Ser Pro 
245 250 255 

Arg Asp Leu lie Val Tyr Asp Pro Leu Pro Ala Asp Glu Tyr Gin Asp 
260 265 270 
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lie Phe Asp Phe Met Lys Gin Tyr Asp Leu Ser Tyr Pro Tyr Glu Tyr 
275 280 285 

Leu Gin Asp Trp lie Asp Tyr Tyr Thr Phe Lys Thr Asp Lys Leu Val 
290 295 300 



Phe Gly Asn Ala Lys Arg Glu 
305 310 



[2] INFORMATION - FOR SEQ IT NO: 46: 
(-5 SEQUENCE CHARACTERISE I OS: 

f f ^ r — \'- -_- . i ■. . ^ _ _ 

N^r . Z i *~a.z> — pa_Io 

(3) TYPE: nucleotide 

(C) S TRANEE DNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ENA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GCCACCGGTA CGGAAACTGA A 21 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ill) HYPOTHETICAL: NO 



(iv) ANT I SENSE : NO 



!x:: SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

C C _ Gl_-. — CA _G_C_A.LiCC A G AAGA 

;2; INFCRM-TICN FOR SEQ ID NO: 48: 

v_) SEQUENCE CHARACTERISTICS: 
.A, LENGTH: 31 base pairs 
:S; TYPE: nucleotide 
(C; STRANDEDNESS: single 
■D) TOPOLOGY: linear 

;n; MOLE CUE TYPE: DNA (genome) 

;ili; HYPOTHETICAL: NO 

(iv; ANT I SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CCGAGATCTT TAACCCTTTG GGCTTAAGCG A 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
(3) TYPE: nucleotide 
(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv, ANTI SENSE : NO 



(xi) BZQZZ^CZ DESCRIPTION: SEQ ID NO: 49: 

GGGAGATCTC CCGCCCGT GC TGTGCATTA 29 

(2) INFORMATION TOR SEQ IT NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
' A ; LENGTH: 2 5 case pairs 
(3) TYPE: nucleotide 

(C) ST HANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AAGAGATCTG CAGCCAAGGC TCTCGAAA 28 
(2) INFORMATION FOR SEQ ID NO: 51 : 
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(l) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(11) MOLECULE TYPE: OX A (ger.oir.ic) 

I:::) HYPOTHETICAL : XI 

(iv) ANT I SENSE: NO 



■xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GGGAGri.^ CTC AGGClGCCGC CGTTGA 

(2) I N E GRMAT I ON EOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
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GGGAGATCTC ACCCCAAGAA CGCCAAAA 

(2) INFORMATION FOR SFQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A; LENGTH: 31 base paxrs 
[3] TYPE: nucleotide 
:c; STRANDEDNESS: single 
;C, TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA (genomic) 

[-v. ANTI3EN3E: NC 

SEQUENCE TESCRIPTION: SEQ ID NO: 53: 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 
( A; LENGTH : 12 base pairs 
[3) TYPE: nucleotide 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(in) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AGTGGCTCCT AG 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(3) TYPE: nucleotide 
(C) STRANDEDNESS: single 
(T, "PC LOGY : iir.ear 

(n) MOLECULE TYPE: UNA (genomic) 

(ill} HYP C THE T I CAT : NO 

(iv) AXTI SENSE: NO 

Ix:; SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

/ioCA^ i C _ C v_ A. C _ ^. _ C AC C CAG 

(2) INFORMATION TOR SEQ ZZ NC: 56: 

(-) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
AGTGGCTCTT AA 

(2) INFORMATION FOR SEQ ID NO: 57: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii ; MOLECULE TYPE: DNA (genomic) 

(in) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



[2) INFORMATION TOR SEQ ID NO: 58: 

(-) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 24 base pairs 
(b; TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

AGCACTCTCC AGCCTCTCAC CGAC 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(in) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(x:) SZQ'ZZ^ZZ DESCRIPTION: SEQ ID NO: 59: 
GTACTTGCCT A3 

(2) INFORMATION FOR SEQ ID NO: 6C: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(3) TYPE: nucleotide 
(C) STRANDEDNESS: single 
\Z j _ V PC__CGY: _mes.r 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
ACCGACGTCG AC TAT C CAT G AACG 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENS ±: : NO 




1 2 



(2) -NrOHVAriON FOR SEQ ID NO: 62: 

SEQUENCE CHARACTERISTICS: 
(A; LENGTH: 10 base pairs 
(B; TYPE : nucleotide 
(C) STRANDEDNESS: single 
(T;. TOPOLOGY: linear 

(n) MCIECT1E TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(:v) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 62: 
GTACTTGGGC 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

ACCGACGTCG ACTATCCATG AACC 24 

(2) INFORMATION ?CR SEQ ID NO: 64 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
(3) TYRE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYRE: DNA (genonic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
AATTCTCCCT CG 

(2) INFORMATION FOR SEQ ID NO: 65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ill) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AGGCAACTGT GCTATCCGAG GGAG 

(2) INFORMATION FOR SEQ ID NO: 66: 

NCE CHARACTERISTICS: 
LENGTH: 1 40 base pairs 
TYPE: nucleotide 
STRANDEDNESS : s ingle 
TOPOLOGY: linear 

(ill) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GATCAACTTT TCCCTGTTTG TCCCATTACC GGTTTGAATG AACCGATTGC GCGCCGCGCG 60 
TGTTGTTGGA CATTACCTGC GATTCAGACG GTACGATTGA CCACTACATC GAGGAGAACG 1 20 

GCAAT CAGGG TACAATGCTA 140 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(l) SEQU: 
(A) 
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(ill) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

GATCCGCG7A CTTGGTTTTT CA7AT 7 IT GC ATAGTCTTGT CGGTCGGGCA TCTTCCCCGA 60 

CATCATCTAA ATTIGTCTTT ATTGGTTTTT ACGCCACTCA TTGCGGATAA ACAATATTCC 120 

GCCTTGC7GT GGCGAATGTT GAAGCTAGCC TGCATCACCG TAATCAGGTT GCCCGTTACC 1 SC 

GAGCGTTCGA GA 192 

:2; INFORMATION FC? SEQ IT. NO: 68: 

U) SEQUENCE CHARACTERISTICS: 

(3) TYPE: nucleotide 

(C; STRA-NCEGNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GATCCGGCTG CCCGACGCGC GCAAAATTGC CGCCGAGGAA AGCGCGCACA ACCACGACGG 60 
CAAAACCAGC GTATGGCAAT ACAAACATCT CGTGTTCGGT ACGGCAGGCA TTTTCTGCTA 120 
TGTCGGCGCG GAGGTGTCTA TCGGTTCGTT GATGGTCAAC GTATTGGGTT ATCTGAAAGG 180 
GCTGGATC 188 
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(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleotide 

(C) STRAXDEONESS : single 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: OXA (gencmic) 
(ill) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(xi ) SEQUENCE DESCRIPTION: SEQ 10 NO: 69: 
GATCCCCCAC TTTACCTCGG GCAGATTTTG CGCGTTCATT ACAATAGCGT ATTTATGCGT 6 0 

TTGCGTTTGC GCTTGCCGCT GCGGGGCCCC CGCCGGTATG GGAAAACATC AAT AT GGCGG 1 20 

TATAAAGCGC GGT AT GGCGG AAAACCT GCC GTTTCCAAGT TTTATTCATC TTTTATTCCT 1 60 

TGAGTTTGCC TTCAC GGGAC GGGGCGGGGC GCGGAACGCG GGGTTCGGTA AACCGCCCGA 24 0 

TTCCGCGCCC GCCGAATTGC TGATTGAAAA GCTTACTTCC CCATTTTAAC TTTGCACACT 300 
GATC 304 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 3 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 



(iv) ANTI SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



GATCAGACCC ATTTTCAGCG CACCGTAAGC GCGGATTTTC TCGAATTTTT CCAAAGCTGC 



60 



GGCATCGTTG TTGAI 



GC AA-""~' 



GCCCGIG TAGCCCAAGT CGGCGGCATT 



1 2( 



:aggaaaacg gtcggaatc-c ccgc 



GTGGCT TTCAAACGGC CTATAT1 



1 SC 



:gggaacata CTGCCTTCGC cgtcggctgg 



243 



(2) I NF CRMAT I CN FOR SEQ ID NO: "1 

U) SEQUENCE CHARACTERISTICS: 

(A) Z-zSGZriz 236 base pairs 
(3) TYRE: nucleotide 
(C) STRANDEDNESS: single 
<D) TCRCTOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 



CGGCGGCGTAGTccgccGcgACAGCGTTACCATAAGCGGGACAGACTACACCCCTTTATCTAACCCGC 
AAAGTTTGGATACGGAATTAAAATGGTTGCTTCAAGAAGCTCCCGAAATAGAAAATCCTTTCGACCGC 
GCCGTTTATCTCCATAATAATTTGGCGTATCTTCAATATTTTAAAGATTGCAATAAACGTACTGCCAG 
AAACTGCATGACCTTGTCGCTGATGCGCTCCG 
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(2) INFORMATION FOR SEQ ID NO: 72: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 base pairs 

(B) TYPE: nucleotide 

(C) STRANGENESS: single 

(D) TOPOLOGY: linear 

(n) MCLECLLE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NC 
(iv) ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CGGTCAATCA CAAGAAAGTC AGCCGTCTGA TGGCGAAGAC GGGGCTGAAG GCAGTGATAT 60 
GGZGGCGC:^ ATACCGCTCG TTCAAAGGAG AAGTCGGCAA AATTGCGCCG AATATCCTGC 120 
GACGCTGTTT C CAT G C A GAA AAGCCGAATG AGAAATGGGT AACGGACGTT GCCGAGTTCA ISC 
ATGTAGGCGG AGAAAAGATA TACCTTTCTC CGATTATGGA TTTGTTTAAC GGGGAAATCG 24 0 

TCAGTTACCG TATTCAGACC CGCCCGACTT TCGATTTGGC 280 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE : nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(ili) HYPOTHETICAL: NO 
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(xv) ANTI SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
CGGTCAGAAA CAGGCAAGGT AATGAAAATG CCTGAGGCAC GGACTGTGCT GCGAACGAAA 60 

ACTCCTTACC GAAGTCTICT ATACCCAGGC TCAATAGCCG CTCAAGGAGA GAGCTATCAT 1 20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(3) TYPE : nucleotide 

[Cj STRA-XDEDNESS : single 




(ivj ANTI SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

CGGTCAGAAA CAGGCAAGGT AATGAAAATG CCTGAGGCAC GGACTGTGCT GCGAACGAAA 60 

ACTCCTTACC GAAGTCTTCT ATACCCAGGC TCAATAGCCG CTCAAGGAGA GAGCTATCAT 120 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ill) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 75: 
CGGTGTITTT C T I AA C AAT 7 CGCCGAC77C A7GGCGA7AT T7AAGTGACA G77GC7CCGC 60 
CCACGCAG77 GCGCCGA.AC7 CAGCACCACG ACA77A7AC7 GAT 7 AT GC AC ATCGGCAAGA 120 
TCAAAC7GAC C7A7CG7AG7 ATCGCAGAC7 G7 152 
(2) INFORMATION FOR SEQ ID NO: 76 




I::; vzzzz"^z TYPE : DNA (ger.orr.ic) 

(ix^) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CGGGAGGTTTTGTGCATCCTGATACCGATCGGTTGTTGTTGCTCAAAGGACAGAAGGCCGCTGATAAA 
CGAGATTACCTGTTTGTCGCTATTGACGATTTTTATACTCTGCCATTTTGCCAGACAAAACCGCAGAC 
AGTGCTGCCAAGTTTCTGACCGAACATCTGGCCGACCCCTGCTTGTACCTGATTGAGTACGCTTACTC 
TGACAATGATAGGTAATATAAAGAGCCGTCCAACATGCTTTCGGTGCAGTTTGTTATGATAATGGGAT 
TGGTTGGAGGCTTGCCCGATTTGCTTGTCCGCAGACCAACGGTAAGGCGGAGCGGGTTATCCGTACCT 
TGATGGAGATGTGGCATGAGGAACAGTCGTTTGACAGACCG 

(2) INFORMATION FOR SEQ ID NO: 77 
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(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(-i) MOLECULE TYPE: DNA (genomic) 
;:::) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
Z 33 A G C AT AA AATCGTTATT AAAGAT AAT G GTATAGGAAC GAGCTTCGAT GAAATCAATG SZ 
ATTTTTATTT GAGAATCGGT C G G AA C A G AA GGGAAGAAAA ACAAGCCTCC CCGTGCGGAA 1 20 

GAATTCCAAC GGGT AAAAAA GGCCTTGGTA AATTGGCATT ATTCGGGCTT GGCAACAAAA ISC 
TTGAAATTTC T ACT AT CC AG G G AAA C G AAA GGGTTACTTT TACTTTGGAT TATGCAGAGA 24 C 

TTCGAAGAAG CAA GGGT ATT TATCAACCG 269 
(2) INFORMATION FOR SEQ ID NO: 78 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(IV) ANT I SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CGGATGAAAACGGCATACGCgcCAAAGTATTTACGAACATCAaAGGCTTGAAGATACCGCACACCTAC 
ATAGAAACGGACGCGAAAAAGCTGCCGAAATCGACAGATGAGCAGCTTTCGGCGCATGATATGTACGA 
ATGGATAAAGAAGCCCGAAAATATCGGGTCTATTGTCATTGTAGATGAAGCTCAAGACGTATGGCCG 

(23 INFORMATION FOR SEQ ID NO : 79: 

{-) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 229 base pairs 
(S; TYRE: nucleotide 

/ — ' O TD '"" v — ^ ^ V • * - *~ ~ — v 
V,!--, lOlTv'j-^sjI . — CiZT 

(in) HYPOTHETICAL : NO 
;iv; ANTI SENSE: NO 

(xi ) SEQUENCE EE SCRIPT I CN : SEQ ID NO: 7 9 

CGGTTTCAGG TTGTCGCGAA GGCTCGGTAA CGGGCAACCT GATTACGGGT GATGCAGGCA 60 

GOTTGAACAT TCGCGACGGC AAGGCGGAAT AIGTTTATCC GCAATGAGTG GCGTAAAAAC 120 

CAATAAAGAC AAATTTAGAT GATGTCGGGG AAGATGCCCG ACCGACAAGA CTATGCAAAA 180 

TATGAAAAAC CAAGTACGCG GATCAGGCAT GGAT GCACGA TCCAATCCG 229 

(2) INFORMATION FOR SEQ ID NO: SO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE : NO 

(x.) SEQUENCE ZESCRZPZZCK : SEQ ID NO: 80: 
CGGGTCGCTT TATTTTG7GC AGGCAI7ATI "ICATTTT7 GGCTTGACAG TTTGGAAAIA 60 
TEGIGTAICG GGGGGGGGZA ET7GCEGACG TAAAAAACTA TAAACGCCGC GCAAAATATG 120 
GCTGACTATA TTA7TGAC7T TGATTTTGTC CTGCGCGGTG ATGGATAAAA TCGCCAGCGA 180 
TAAAGAATET GCGAGAACCE GkZGCCG 207 
(2) INFORMATION 7 FOR SEQ ID NC: 81 : 

(A) LENGTH: 2 24 base pairs 

(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CGGCAACGAT TTGAGCTATC GCGGTTACGA CATTCTGGAT TTGGCACAAA AATGCGAGTT 60 
TGAAGAAGTC GCCCACCTGC TGATTCACGG CCATCTGCCC AACAAATTCG AGCTGGCCGC 120 
T T AT AAAACC AAGCTCAAAT CCATGCGCGG CCTGCCTATC CGTGTGATTA AAGTTTTGGA 180 
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AAGCCTGCCT GCACATACCC ATCCGATGGA CGTAATGCGT ACCG 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 212 base pairs 
(3) TYPE: nucleotide 
[C] STRANDEDNESS : single 
( T , TCPCLCGY: linear 

In.) MC1ECC1E TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

;iv; ANT I SENSE : NT 

:xi) SZZ'S^CZ TESCRITTIIX: SEQ ID NO: 32: 

CGGGAACAGC CATTGCCCAC GZCCACGCCC CCCAAGAAAG ACGGAAACTA CTGCCTAAAT 

TTTCC-GCAAT CAAGTT3AC3 ATTAAAGGGC TGGGGGCAGT TGCAGTAATA AAC AT AGC C G 

ACGAAATGGG ATTGGAATGA TAGTTGACCA AAGCCAAATA TTTACCCATC TTGCCTTCTG 

TGCCTTTTGC GGGATTGGAG CCGTAACTGC CG 

(2) INFORMATION FOR SEQ ID NO: 83 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 
{3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



REPLACEMENT SHEET (RULE 26) 



136 



(iv) ANT I SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: S3 



CGGGAATTCT GAGCAGAA7G AAA G AAAGC A GGCTTGATAA TTTCATAAAG TTATTGGAAG 



AAAAAGGA77 7 AC 



iT 77CGG7A77C ACAATACGGC 7GAT7ACGGA AT7CCCCAAA 



GCCG7AAAAG A777ACG77A A77GCAAACA GAA7AACCAA AGAAAAGC7G GAACCAG7CA 



AGTA77CGGG CAAACC-GC77 ACGG7AGCCG ATGTTTTGGG AATGGAAA7G 



:caa 



CATTA77GCA GGACACCAAG ACGAAACGGA TTTTATGCAT AGCTGTGCGG GAA77A7CTG 



sz-f. ^ fL w ^ 



:7GA7ACC7A AAAACGGAGG 



(2) IXECRVA7ICX ecr 



NO: S4: 



{?.) LENG7H : 3C8 base pairs 

;3) TYPE: nucleotide 

(C) S7RAXDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE 7Y?E: ON A (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



AATTCCGTAT CCAAACTTTG CGGGTTAGAT AAAGGGGTGT AGTCTGTCCC GCTTATGGTA 



ACGCTGTCGC GGCGGACTAC GCCCGGAGCC TTTTTCCAGT AAGTTTTCGG AAATCAGGCT 



GTGGGTGGTT TTTAAGAAAT CCAACCAGTC AAACGGCTCG GGGCTGTCCA AACCGGACAC 
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AGGTGCCGGT AACTTTCCCT CAGGTTGATT AACATTACGG CATCCGAATA TAACTTCCCG 240 

CCTGCGGTTT GCCCGAGTTT AAGCAATGCC TGCGTATCGT ATTGATTATA AAGTGTTTCC 300 

TTCCAATT 308 

(2) INFORMATION FOR SEQ ID NO: 55: 

[Lj SEQwEXCE C~ A? ^ C~ r R~ ~ — 0^ * 

(Ay ~ i 1 C4 cdss pairs 

[3) TYPE: nucleotide 

(D) TOPOLOGY : linear 



(:v) ANT I SENSE : NO 

(x:) SEQUENCE OESCRIPOIOX: SEQ ID NO: 85: 

AATTCGTGTG CCGOGTCGAO AAACCGCTGA CGTAGCGGAT GTCTCATGCC ACGTTTCAAA 60 

GCAGGTTGAT GGCGGT7AGC AAOCCTCTGA TTTCACTGGG ATAT 104 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 89 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(in) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
AAT73CGTAG AGTGGGCTTC AGCCACGTTT TTTCTTTTTC GGTCGTTGAT TGGTGGGCTG 60 

AACCAcrror ttcggaaaec cgtaicaig 89 

(2) INT CRM-. 7 1 OX FOR SEQ 10 NO: 87: 

Ii; SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 3 base pairs 

(3) TYPE: nucleotide 

(C) S7RANDEDNESS : single 



MOLECULE TYPE: DNA (genomic) 
' l 1 1 ^ " V ~P OT"E 0 1 C A 0 : \0 
[:v) AX" SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

AATTTCCACC TATGCCCTAC GCAGCGATTA TCCGTGGTTT ACCCAAAGGG TGATTATGGC 60 

AAAAGCGCGG GGTTGAGCGA CCGCCTTTTG TTGCCGGCGT TCAAACGGGT TTTGATAGGA 1 20 

AAIGCAGGCA CGAAGCCTCG GCTGATTGTG ATGCACCTGA TGGGTTCGCA CAGTGATTTT 1 80 

TGCACACGTT TGGATAAGGA TGCGCGGCGG TTTCAGTATC AAACTGAAAA AATAT CCTGC 240 

TATGTTTCCA TCAATCGCGC AAACCGATAA ATT 27 3 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 0 base pairs 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(in) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
AATTCTTCCG CACGGGGAGG CTTGTTTTTC TTCCCTTCTG TTCCGACCGA TTCT 

i-u-*Aj-i _ ^ n — -jr A C- - C o-- .-.o — CO _ -.T AC CA A _ CTT T AAT AA CGA _ 

CTCCGGTTTA TCGAATAACC TAACTTCCAC TTCCGTAGCA CATGCATCGT AGGC = 
TATCAACTCG GC AAT C GC AG GAACA3TGTG CGAATACAAT CTTTACACCC AAAT 
TTACGGTTGG CTCGAAACTC AATTTCAAXT 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleotide 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
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AATTATGAAC ACACGCATCA TCGTTTCGGC TGCGTTCGTT GCGTTGGCAT TAGCAGGTTG 60 

CGGCT CAATC AATAATGTAA CCGTTTCCGA CCAGAAACTT CAGGAACGTG CCGCGTTTGC 1 20 

CTTGGGCGTC ACCAATGCCG TAAAAATCAG CAACCGCAGC AATGAAGGCA TACGCATCAA 180 

CTTIACCGCA ACIGIGGGIA AGCGCGTGAC CAATGCTATG TTACCAGTGT AATCAGCACA 240 

ATCGGCGTTA CCACTrCCGA TGCAATT 267 

[2) INFORMATION FOR SEQ ID NO: 90: 

;_; SIQ'JEXCE CHARACTERISTICS: 

(A; LENGTH: 2 34 base pairs 

;C; 3TRAJOEDNESS: single 
' E, TOPOLOGY: linear 

( 1 1 ; MOLE CUE TYPE : DNA ( genomic ) 

(ill) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(xi) SZQ:j^CE DESCRIPTION: SEQ ID NO: 90: 

AATTTTTATT TGGTTCGTAG TCATTTTGTG CAACTGAACG ATATTCGTTT TCATCATTGC 60 

TAACGTCTAG TGCCCATTGT GGCCCGTAAT AAGAGATTTC GTCTCCTTTT ACATGTTTGA 120 

CGCTGACGGC ATACTGGGGA TCGATGACGG ATAATGTACG TCTGTTGACA TCTGCAACGC 180 

TAAATCAATC ATCGGTATTG GATAATGCGT TGCCGATGTT TTGACTTGTA TGTT 2 34 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 base pairs 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA (genomic) 
(in) HYPOTHETICAL: NO 
!:v; ANT I SENSE: NO 

(x;; SEQUENCE DESCRIPTION: SEQ ID NO: 91 : 

AAT7CGGCCG GCTGTGTCAA ATAATGCGTT ACTTTGGCCG GGTCTTGTTC TTTGTAAGTG 60 

GTGGTCTT" TI7GCC-CGTI AICCCCAICT GI7IGAGIGC ATAGCAAATG GIGGCTGCCG 120 

TACAAICAAA TGTTTGGCGT T CAT GCAGAT AGGCATCATG GTGTTGCCCA ATATATTGAG 180 

CCGGTTTTTG CCTATCCGAT TTGACGGC AT TTAGACCGGT AACTTGATGT TTTAAGCTGC 24 C 

CTGTTTGTTT AAAGGCGAAT CC AC AAGTAA AGCGTGTTTC TTGACAGGTT AAACG 295 

(2) INFORMATION FOR SEQ ID NO: 92: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 base pairs 
(3) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
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AATTGTGTA7 A7CAAGTAGG ATGGGCATTT ATGCCTGACC TACAAAACCA AAAACAACCT 60 

ACCACCCTTA A7CAACTCCA CAAACCCTCT TCAGACAACC TCGTTTTTTG AAAAACAATC 120 

TGTAAACAGA TAACTGC7GA AGAATACCGT TGCCGAGCCC CAAAACCCGT ACTGCAACTT 180 

T 7 AT T GT GAA CTTCCCATTA 7GAGAAAA7C CCTTTTCGTC C7C777CTG7 A77CG7CCCT 24 0 




(— i) MC_^C7_— _Y?^: 7NA { genome) 
(iv) AN7I SENSE: NO 

(xi) SEQUENCE 7ESCRIP7ICN: SEQ ID NO: 93: 

AATTGCACCA CGCGATGATG GG7ACGCCTC TGTTGCCATT GCGACCGCCG CCGCCGTGCC 60 

CGG7ACGCTG GTCAACCTTG CCGCGGCGGA ACGGGTAAAG AAGTGCGCTT CGGGCATCCT 120 

TCCGG7ACA7 TGCGCGTCGG TGCAGCGCCG AATGTCAGGA CGGACAATGG ACGGCCACCA 180 

AAGCGGTTA7 GAGCC GC AGC GCACGCGTGA TGATGGAAGG TTGGGTCAGG GTGCCGGAAG 240 

ATTGTTTT7A AATTGGACGG CGAACCGGTC TATTCGTATT GGCGTTATAC CGCCGCAAAG 300 

GCAGACCTTG AAACTGG7 GC GTGCCGTGCA GGGCATGTAC GGCTATGTGT GCGTGGCGGG 360 
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CGGATTTGAT GTGCGGAAT 379 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 base pairs 
(3) TYPE: nucleotide 
(C) STRANDEDNESS: single 
[Z) TOPOLOGY: linear 

MCLECTLE TYPE: DNA (genome) 

(ill) HYPOTHETICAL: NO 



W !;v, ANTISEX3E: NO 

IIJ 

M ;x:) SEQTIXCI EE SCRIPT I ON: SEQ ID NO: 94: 

4= 

||| AATTTGTTGG GCAGATGGCC GTGAATCAGC AGGTGGGCGA CTTCTTCAAA CTCGCATTTT 60 



5 

o 

£ 

ru 



T GT GC C AAAT CCAGAATGTC GTAACCGCGA TACGTCAAAT CGTTGCCGGT ACGCAACGGT 1 20 

ACACAAAGCG GTATTACCGG CCGCAACGCC AGAAAGCGCA ACGGATTTTT AGGTTTGAGG 180 

GTCGGGGTTT GAGTAGTTTC AGTCATGGTA TTTCTCCTTT GTGTTTTTAT GGGTTTCGGG 240 

TTTTCAGACG AC C GAT GC GG ATTTGTTGAA AGGCAGTCTG AAAGCGGTAA ATCATTTTTG 300 

AAACAATT 308 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 286 base paxrs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I SENSE: NO 

(xi; SEQUENCE CESCRIPTICN: SEQ ID NO: 95: 
AATTCGGAGG AGCAGTACCG CCAAGCGTTG CTCGCCTATT CCGGCGGTGA TAAAACAGAC 
GAGGGTATCC GCCTGATGCA ACAGAGCGAT TACGGCAACT TGTCCTACCA CATCCGTAAT 
AAAAACAT GC TTTTCATTTT TTCGGCAAGC AATGACGCAC AAGCTCAGCC CAACACAACT 

cACC^ _ A — o- w T--_ A _ cr AAJ^AAGACGC T CAAAAAGGC ATT AT CACAG T T GCAGGC GT 

n^A. w ^ ^>_.-.G _ c w^-. ^r--_-_j-_r-. G _ _ C A--l _ GGC _ C C AAC C AT T GC GGAAT T 
[2] INESRVATISN TCP. SEQ IT NC : 96: 

(A) LENGTH: 233 base pairs 

(3) TYPE: nucleotide 

(C) STRANDEDNESS: Single 

O; TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
AATTTGGATA CGTTGGAAAA GGGATATTTG ATTGGGAATG GGATGAAGAT AAGCGTAGAT 
GAGTT GGGGA AAAAAGT GT T AGAACATATC GGTAAGAATG AACCGTTATT GTTGAAAAAT 
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CTACTGGTTA ACTTCAATCA GGGAAAACAT GAAGAAGTTA GGAAGTTGAT TTATCAGTTG 180 

A Z A G A G 7 T A G AXTTTCTGGA ACTTTTGTGA GGGATTCTAT GAAAAACTGG AAGCAATT 23 8 

[2} INFORMATION FOR SEQ ID NO: 97: 

(_) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 
[3} TYPE: nucleotide 
(C; ST RAND EDNESS : single 
(D; TOPOLOGY: 1: 



(n; MC_ECT1E TYRE: DNA (genomic) 



:-:yr: 



;iv; ANT I SENSE : NC 

:x:; SZQ'SZ^ZZ DESCRIRTICN: SEQ ID NO: 97: 

AATTCGGCAC GCAGGTTTTC TAAAAAAAGG CCGTTGATGA CTTTGTCGAT ATTGGCGGCT 6C 

TCGGTGTAGT GCGCGCCCGC TT C GGCCGCT CTTGCGCGTC CATGACGGAT TGGAAGAGCG 120 

TGCCGAAGAT TTCTGGACTG AT GTTGCGCC AGTCGAAATT GCCGACACGG GAGGAATACC 180 

TGCCAACAAG AGTGCAGGCA GCGTAATCAA ACCACCCCCA CCCGCAATCG CATCGATAAA 240 

TGCGGCAATC AT CGC AACCA AACCCAAAGC GAGTATTATG TATAAATCTT CCATGTTTCT 300 

TAATCCT GET AACTTGCACC AA 32 2 

[2] INFORMATION FOR SEQ ID NO: 98: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 

(3) TYPE: nucleotide 

(C) STRANDEDNESS : single 
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CD) TOPOLOGY: linear 
(u) MOLECULE TYPE: DNA (genomic) 
(ni) HYPOTHETICAL: NO 
Civ) ANT I SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 
AATT7GTCGG CAATCTTCCC C-GGTCGCTTT ATTTTGTGCA GGCATTATTT TTCA777' 
GCTTGACAGT TTGGA GAT AT TGTGTATCGG GGGGGGGTAT TTGCTGACGT AAAAAAC 

-~- ~— ~ v-c^CG^A G C G GC_ GAG - A jl A. -TA^TGAC-T TGA 'iTG^C / Zi'3'Z3'Z^: 

A T G G A 7" AAAA TCGCCAGCGA TAAAGA77TG CGAGAACCTG ATGCCGGCCT GTTG77G 
A77TTCGACC 7G7AA7IACG A7TTGGCTTC CGCGCCGGCA CAATATGCCG CCAAGCG 
CCCACATTTT GGAAGC 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
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AATTCGGACA GTATGAATAC AGCGGATTAA TACAAGGTAA GTTCATTACA ACGGAAAAAC 60 

CTTTAAAGAA TAATATGAAA GGTATTACCT TGTTTGCCAA CGGGAAT GGT AAATATGCCC 1 20 

GAGTTTTTCA CTGAATAGCG AATCCAGCCA TTTCTATTCA TATTTGACTG GATGGCTGAA 1 80 

IGTGGACTTT ATAGA7AA7G ACGATGAAGA TTTAATT 217 
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CLAIMS 

1/ DNAs, characterized in that they are in all or part 
genes, with their reading frame, present in Neisseria 
5 meningitidis (called Nm below) , but absent both from Neisseria 
gonorrhoeae (called Ng below) and from Neisseria Pactamica 
[sic] (called Nl below) , with the exception of genes involved 
in the biosynthesis of the polysaccharide capsule, frpA, frpC, 
opc, pcrA, rotamase, the sequence IC1106 [sic], IgA proteases, 
10 pilin, pilC, proteins which bind transferrin and opacity 
proteins . 

2/ DNAs according to claim 1 , characterized in that they 
are present in Nm, but absent from Ng. 

3/ DNAs according to claim 2, characterized in that they 
15 comprise one or more sequence (s) present on the chromosome of 
Nm Z2491 between tufK and pilT, or region 1 of the chromosome, 
and/or the nucleotide sequence (s) capable of hybridizing with 
the said sequence (s) . 

4/ DNAs according to claim 2, characterized in that they 
20 comprise one or more sequence (s) present on the chromosome of 
Nm Z2491 between pilQ and >*740, or region 2 of the chromosome, 
and/or the nucleotide sequence (s) capable of hybridizing with 
the said sequence (s). 

5/ DNAs according to claim 2, characterized in that they 
25 comprise one or more sequence (s) present on the chromosome of 
Nm Z2491 between argF and opaB, or region 3 of the chromosome, 
and/or the nucleotide sequence (s) capable of hybridizing with 
the said sequence (s). 

6/ DNAs according to claim 3, characterized in that their 
30 sequence corresponds in all or part to SEQ ID No. 9, 13, 22 or 
30, and/or to any sequence located at more or less 20 kb from 
these SEQ ID on the chromosome of an Nm strain, and/or is 
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capable of hybridizing with at least a fragment of any one of 
these sequences. 

7/ DNAs according to claim 4, characterized in that their 
sequence corresponds in all or part to SEQ ID No . 1 , 2, 4, 6, 
5 7, 10, 15, 31 or 34, and/or to any sequence located at more or 
less 20 kb from these SEQ ID on the chromosome of an Nm 
strain, and/ or is capable of hybridizing with at least a 
fragment of any one of these sequences. 

8/ DNAs according to claim 4, characterized in that they 

10 are all or part of the DNA sequence SEQ ID No. 36 or sequences 
corresponding to the open reading frames SEQ ID No. 37, SEQ ID 
No. 38, SEQ ID No. 39, SEQ ID No. 40, SEQ ID No. 41, SEQ ID 
No. 42, SEQ ID No. 43, SEQ ID No. 44, SEQ ID No. 45 and/or to 
any sequence located at more or less 20 kb from these SEQ ID 

15 on the chromosome of an Nm strain, and/or is [sic] capable of 
hybridizing with at least a fragment of any one of these 
sequences . 

9/ DNAs according to claim 5, characterized in that their 
sequence corresponds in all or part to SEQ ID No. 8, 21, 23, 

20 25, 26, 28, 29, 32 or 35, and/or to any sequence located at 
more or less 20 kb from these SEQ ID on the chromosome of an 
Nm strain, and/or is capable of hybridizing with at least a 
fragment of any one of these sequences. 

10/ DNAs according to claim 2, characterized in that 

25 their sequence corresponds in all or part to SEQ ID No. 3, 5, 
1 1, 1 2, 14, 1 6, 18, 1 9, 20, 24, 27 or 33, and/or to any 
sequence located at mere or less 20 kb from these SEQ ID on 
the chromosome of an Nm strain, and/or is capable of 
hybridizing with at least a fragment of any one of these 

30 sequences . 

11/ DNAs according to claim 1 , characterized in that they 
are common with those of Ng, but are absent from Nl. 
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12/ DNAs according to claim 11, characterized in that 
they comprise one or more sequence (s) present on the 
chromosome of Nm Z2491 between arg J and reg F, or region 4 of 
the chromosome, and/or the nucleotide sequence (s) capable of 
5 hybridizing with the said sequence (s) . 

13/ DNAs according to claim 11, characterized in that 
they comprise one or more sequence (s) present on the 
chromosome of Nm Z2491 between the marker lambda 375 to pen A, 
or region 5 of the chromosome, and/or the nucleotide 
10 sequence(s) capable of hybridizing with the said sequence(s). 

14/ DNA according to any one of the preceding claims, 
characterized in that it codes for a protein exported beyond 
the cytoplasmic membrane. 

15/ DNAs according to any only of claims 1 to 14, 
1 5 characterized in that all or part of their sequence 
corresponds to a region conserved within the Nm species . 

16/ DNA according to any one of claims 1 to 15, 
characterized in that it is inserted in a transfer or 
expression vector, such as a cosmid, plasmid or bacteriophage. 
20 17/ Host cell, more particularly bacterial cell or Nm 

cell, transformed by insertion of at least one DNA according 
to any one of claims 1 to 15. 

18/ Cell comprising genes or gene fragments specific to 
Nm, more particularly bacterial cell or Nm cell, the 
25 chromosome of which is deleted by at least one DNA according 
to any one of claims 1 to15, in particular a DNA responsible 
for the pathogenicity. 

19/ DNAs, characterized in that their sequence 
corresponds in all or part to the transcription of at least 
30 one DNA sequence or sequence fragment according to any one of 
claims 1 to 15. 

20/ Antisense nucleic acids, characterized in that their 
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sequence corresponds to the antisense of at least one 
nucleotide sequence according to any one of claims 1 to 1 5 or 
19, or a fragment of such a sequence, and in that they carry, 
where appropriate, at least one chemical substituent, such as 
5 a methyl group and/or a glycosyl group. 

21/ Polypeptides, characterized in that they have an 
ammo acid chain corresponding to all or part of a sequence 
coded by the nucleic acids defined in any one of claims 1 to 
15 or 19, or deduced from sequences of these nucleic acids, 
10 with, where appropriate, modifications with respect to the 
coded or deduced sequences, where these modifications do not 
alter the biochemical properties observed in the natural 
polypeptide . 

22/ Peptides according to claim 21 , characterized in that 
15 they are peptides exported beyond the cytoplasmic membrane, 

more specifically peptides corresponding to all or part of 

those coded by a DNA according to claim 14. 

23/ Antibodies, characterized in that they are polyclonal 

or monoclonal antibodies directed against at least one epitope 
20 of a peptide according to claim 20 or 21 , or fragments of 

these antibodies, more particularly fragments Fv, Fab, Fab 1 2, 

or also anti-antibodies capable of recognizing, by a reaction 

of the antigen-antibody type, the said antibodies or their 

fragments . 

25 24/ Process for obtaining Neisseria meningitidis- specific 

DNA banks, comprising 

- mixing of two DNA populations, 

- realization of at least one subtractive hybridization- 
amplification iteration, and 

30 - collection of the desired DNA or DNAs, followed, where 

appropriate, by its/ their purification with elimination of 
redundant sequences . 
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25/ Process according to claim 24, characterized in that, 
to obtain a bank which is specific to Nm, in contrast to Ng 

two DNA populations originating respectively from a 
strain of Neisseria meningi tidis , or a reference strain , for 
5 which the specific bank is to be constructed, and a strain of 
Neisseria gonorrhoeae , or a subtraction strain, the DNA 
sequences of these strains being those obtained by 

random shearing of the chromosomal DNA of the 
subtraction strain, in particular by repeated passage through 
' C a syringe, and 

cleavage of the chromosomal DNA of the reference 
strain, preferably by a restriction enzyme producing fragments 
less than about 1 kb in size, and in that to obtain a bank of 
C^TAs corcon between Nm and Ng, but specific with respect to 
15 Nl, three different banks are constructed, two of them by 
digestion of the chromosomal DNA of Nm by Mbol and Tsp5091 , 
and the third by digestion of the chromosomal DNA of Nm with 
MspZ , two subtraction series are carried out, and the DNAs 
having the required specificity are collected. 
20 26/ Banks of DNA clones obtained by carrying out the 

process according to claim 24 or 25. 

27/ Use of the process according to claim 24 to obtain 
banks of DNAs specific to a given cell or to a given variant 
of the same species of cell, where another species or another 
25 variant which is close genomically and expresses different 
pathogenic potencies exists, in particular banks of DNAs 
specific to cryptococci, Haemophilus, pneumococci or also 
Escherichia . 

28/ Method for diagnosis of a meningococcal infection, 
30 and more particularly of meningococcal meningitis, by 
demonstration of the presence of Neisseria meningitis in a 
biological sample, characterized in that it comprises the 
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stages of: 

bringing into contact a biological sample to be 
analysed and a reagent formulated from at least one nucleic 
acid as defined in one of claims 1 to 15 or 19, if appropriate 
in the form of a nucleotide probe or a primer, or, as a 
variant, from at least one antibody or a fragment of an 
antibody, as defined in claim 23, under conditions which allow 
respectively hybridization or a reaction of the antigen- 
antibody type, and 

- detection of any reaction product formed. 

23/ Method for diagnosis of an immune reaction specific 
to meningococcal infection, characterized in that it comprises 
the stages of : 

bringing into contact a biological sample to be 
analysed and at least one polypeptide according to any one of 
claims 2 ' or 22 or an anti-antibody according to claim 23, or 
a fragment thereof, these products being labelled, where 
appropriate, under conditions which allow a reaction of the 
antigen-antibody type to be effected, and 

- detection of any reaction product formed. 

30/ Kits for carrying out a method according to any one 
of claims 28 or 29, characterized in that they comprise 

- at least one reagent as defined in claim 28 or 29, that 
is to say of the nucleic acid, antibody or peptide type, 

products, in particular markers or buffers, which 
enable the intended nucleotide hybridization reaction or 
immunological reaction to be carried out, as well as use 
instructions . 

31/ Vaccine composition including in its spectrum, in 
particular in combination with at least one childhood vaccine, 
antimeningococcal prophylaxis and intended for prevention of 
any form of infection by Neisseria meningitidis, characterized 
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in that it comprises, in combination with (a) physiologically 
acceptable vehicle (s) , an effective amount: 

- of peptide according to claim 21 or 22, or 

of antibody or anti-antibody fragment according to 
claim 23, 

this material optionally being conjugated, in order to 
reinforce its immunogenicity , with a carrier molecule such as 
a poliovirus protein, tetanus toxin, protein produced by the 
hypervariable region of a pilin. 

32/ Vaccine composition including in its spectrum, in 
particular in combination with at least one childhood vaccine, 
antimeningococcal prophylaxis and intended for prevention of 
any form of infection by Neisseria meningitidis , characterized 
in that it comprises, in combination with (a) physiologically 
acceptable vehicle (s) , an effective amount: 

- of nucleic acids according to any one of claims 1 to 1 5 
or 1 9 or 

- of ceils according to claim 17 or 18. 



REPLACEMENT SHEET (RULE 26) 



1/9 



09/2 



us lit i in i i 




o 

T3 

O 



11 in t 



itii 1 



?5 jii i nn 1 




o 

<5 



t it 
f 



15 Hi 1 Hit i 



I I t I 



^ to 

r- ^> 



i i 



1 1« 



\i ill 1 jii^i 

^cv^Vdd 0000*0 



REPLACEMENT SHEET (RULE 26) 



2/9 

Figure 2 
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Figure 8B 
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was filed on as U. S. Application Serial No. 
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U~! be true; and further that these statements were made with the knowledge that willful false statements and the like so made are punishable by fine or 
% fl imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements may jeopardize the validity of 
7 the application or any patent issued thereon. And I hereby appoint NIXON & VANDERHYE P.C, 1100 North Glebe Rd.. fith FlnnrjiH^nnva 
22201^714 1 Jelephone number (703) 816-4000 (to whom ail communications are to be directed), and the following attorneys thereof (of the 
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Y0 connected therewith and with the resulting patent: Arthur R. Crawford, 25327: L arry S. Nixon, 25640^Robert A. Vanderhye, 27076; James T. Hosmer, 
ri§301B*-Robert W. Faris.^SSSr-Richard G. Besha, 22770: Mark E. Nusbaum, 3234SLMichaeI J. Keenan, 321 06; Bryan H. Davidson, .30251; Stanley ' 
LT 5 C. Spooner,2Z393^Leonard C. Mitchard, 22009; Duane M. Byers, 3336aLPaul j. Henon, 33626^Jeffry H. Nelson, 30481; John R. Lastov031 49; H. 
IU Warren Bumam, Jr.,^93££^Thomas E. Byrne, 3 2205; M ary J. Wilson, 32955; J. Scott Davidson. 33489; Je rry D. Craig, 38Q26. ' ' 
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